IN THE UNITED STATES 
PATENT AND TRADEMARK OFFICE 



In re patent of: 

Thomas Sean Houlihane 

Appl. No.: 10/743,473 (Pub. No.: US 2005/0004777) 

Publication Date: Jan. 6, 2005 - Applicant's amended claims dated: January 22, 2007 
For: GENERATION OF A TESTBENCH FOR A REPRESENTATION OF A DEVICE 

Submission of Prior Art Under 37 CFR 1.501 

To: : 

Mail Stop M Correspondence 
Hon. Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 



Sir or Madam: 

The undersigned herewith submits in the above identified patent the following prior art 
(including paper copies and electronic duplicates thereof) which is pertinent and applicable to the 
patent and is believed to have a bearing on the patentability of at least claims 1, 3-25, 27-46: 

1 . A copy of presentation slides entitled: Svstem-on-chip ( SOQ and its Effect on 
Microcomputer Bus Products , as presented to the VMEbus Standards Organization 
(VSO) on July 21, 1999 in Vancouver, B.C. Canada. 

The document provided is original except for minor formatting changes required to print 
it out using a current version of Microsoft PowerPoint, and changes to the copyright 
notice allowing unlimited copying and distribution. 31 pages. 

2. A copy of a technical reference manual entitled VME64 to PCI Bridge S vstem-on-Chip 
(SoQ, first published by Silicore Corporation on December 7, 2002. 

The document provided had changes made to the copyright notice and company address 
in 2004 (see p. 4). 129 pages. 
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3. A copy of the VHDL source code files representing the VMEcore entity and testbench, as 
produced by Silicore Corporation's parametric core generator known as The VMEbus 
Interface Writer™ on 14 Feb 2002. 

Permission is hereby granted by Silicore Corporation to disregard the copyright notice 
listed at the top of each file page and substitute it with: "Copyright © 2002 Silicore 
Corporation. This document is provided under the terms of the GNU Lesser GPL 
License 2.1 as published by the Free Software Association." 46 pages. 

4. A copy of the VHDL source code files representing the VMEcore entity as distributed 
with the VME64 to PCI Bridge Svstem-on-Chip (SoC\ and described at § 4. 73 VHDL 
Synthesis and Test (p. 81) in the technical reference manual. 31 pages. 

5. A copy of WISHBONE Svstem-on-Chip (SoQ Interconnection Archite cture for Portable 
IP Cores , Revsion B.2, dated: October 10, 2001. 109 pages. 

6. A copy of a press release from Altium Ltd. (Sydney, Austrailia) dated 28 Apr 2003 
entitled: Altium Unveils new 'Board-on-Chip* Technology . 3 pages. 

This file was downloaded on 21 Mar 2007 from URL: 
http://www.altium.com/files/corp/media/pdfs/mr_corp_280403.pdf 

7. A copy of a press release from Altium Ltd. (Sydney, Austrailia) dated 1 7 Nov 2003 
entitled: Altium Introduces Systems Focus to FPGA Design with Nexar . 5 pages. 

This file was downloaded on 21 Mar 2007 from URL: 
http://www.altium.com/files/corp/media/pdfs/mr_171103.pdf 



Each of these references discloses a method for the GENERATION OF A TESTBENCH FOR A 
REPRESENTATION OF A DEVICE. It is believed that each of these references fall within the 
scope of Houlihane (as defined by the specification), and therefore has a bearing on the newly 
proposed claims. 



Item 1, Svstem-on-chip (SOO and its Effect on Microcomputer Bus Products , is a presentation 
used to publicly announce VMEcore™, an IP Core that operates as a bus interconnect block 
advantageously interfacing the VMEbus microcomputer bus to WISHBONE, a System-on-Chip 
residing inside of a semiconductor integrated circuit. With respect to this document: 

a) The announcement starts on page 19 of the presentation. VMEcore is a representation of 
a device, written in the VHDL hardware description language, which is incorporated into 
a data processing apparatus. 



THIS PAGE BUNK 



(USPTC) 



(Page 3 of 6) 



VMEcore is created by a software processing tool called The VMEbus Interface 
Writer™ , a parametric core generator that: (i) receives configuration data options in the 
form of a machine readable text file, (ii) autonomously generates a VHDL testbench that 
makes reference to the configuration data, and which includes a first set of templates, 
known as test vectors defining inputs and outputs in a test environment and (iii) 
autonomously generates a representation of the device with reference to the configuration 
data in the form of a second set of templates including in the form of one or more 
synthesizable VHDL files. 

At p. 24 in the presentation, The VMEbus Interface Writer was demonstrated to the 
public audience at the July 21, 1999 VSO meeting, and included: (i) the examination of 
all input and output files and (ii) operation in 'batch mode' where the tool is operated in 
combination with other tools as defined in a script file. This demonstration represents the 
last possible critical date for first public use and first offer for sale of this technology. 

The internal operation, methods for combining IP cores and test elements, and related 
documentation for The VMEbus Interface Writer is a trade secret of Silicore Corporation, 
and have never been publicly displayed. 

Item 2, VME64 to PCI Bridge System-on-Chip (SoCV is the technical reference manual for a 
VMEbus to PCI bus bridging System-on-Chip that uses a VMEcore™ interface as created by 
The VMEbus Interface Writer™ and which was successfully reduced to practice. With respect 
to this document: 

a) The detailed operation of the interface is described in § 4.7 VMEcore™ Entity (p. 81- 
101). 

b) A block diagram showing the operation of VMEcore with respect to other system 
elements is shown on p. 5. 

c) This SoC was created with U.S. Government funds, with original documents available to 
U.S. Government employees under the contract number shown on p. 2. It was originally 
developed and sold directly to the U.S. Army Crusader project, a 155 mm self-propelled 
Howitzer cancelled by the Bush Administration in 2002. The Crusader has similar 
features to Archer, a 155 mm self-propelled Howitzer made by BAE Systems / Bofors 
(Sweden). 

Item 3 is a compilation of VHDL and text files autonomously submitted to, and obtained from, 
The VMEbus Interface Writer in order to create a custom copy of VMEcore used in the system 
described in Item 2. With respect to these files: 

a) File <SlavTest.txt> contains configuration data supplied to The VMEbus Interface Writer 
which describes the precise parameters that it should use to create the customized 
VMEcore output files, including a representation of the device. For example, the line 
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specifying "SLAVE: A24:SGL" indicates that the tool should create a VMEbus interface 
that supports the A24 addressing mode. Similarly, lines beginning with a character are 
commented out. 

b) File <SlavTest.log> is an output file indicating that the batch operation worked 
successfully, and includes a time and date stamp. 

c) File <VmeCoreT.vhd> is an output file which is a testbench that makes reference to 
configuration data near the comment field marked "Interface:" (e.g. "A24:SGL"). The 
body of the file contains a first set of VHDL templates which define the test environment, 
including the ability to read a test vector file located in file <VmeCoreV.txt>. This 
operation is performed using a variety of standard, IEEE compliant VHDL simulation 
tools available from a number of suppliers. 

d) The remainder of the files have a \vhd' file extension indicating that they are VHDL 
files, and which are advantageously arranged according to the hierarchical diagram 
shown in Figure 4-29 (p. 89) of the technical reference manual listed in Item 2. This 
second set of templates each make reference to the configuration data near the comment 
field marked "Interface:" (e.g. "A24:SGL"). 

Item 4 is a compilation of files included with the standard distribution of the VME64 to PCI 
Bridge Svstem-on-Chip (SoQ . With respect to these files: 

a) This compilation of files was adapted from the standard VMEcore output described in 
Item 3 above to: (i) include a 'DDIR' signal which was requested by the customer on or 
about 1 Mar 2002; (ii) by changing the copyright notice on 17 Jan 2004; and (iii) by 
eliminating the testbench and test vectors files which were not needed for the high level, 
hierarchical bridge SoC design (a separate file, not shown here, is included for that 
purpose). 

b) These changes were implemented by modifying the output files rather than The VMEbus 
Interface Writer tool itself. This prevented the use of U.S. Government funds when 
modifying the tool, thereby preventing it from being subjected to U.S. Government 
Purpose Rights under DFARs 252.227-7014 (JUN1995), including inspection 
requirements thereof. 

Item 5, WISHBONE Svstem-on-Chip (SoC) Interconnection Architecture for P ortable IP Cores, 
Rev. B.2, defines the interconnection system advantageously used by the aforementioned 
methods. With respect to this document: 

a) From the Introduction on p. 7: "This specification does not require the use of specific 
development tools or target hardware. Furthermore, it is fully compliant with virtually all 
logic synthesis tools. However, the examples presented in the specification do use the 
VHDL hardware description language. These are presented only as a convenience to the 
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reader, and should be readily understood by users of other hardware description 
languages (such as Verilog®). Schematic based tools can also be used." 

b) A specific objective of the WISHBONE SoC indicated on p. 10 is: "A further objective 
of the specification is to be independent of the underlying hardware description. For 
example, soft cores may be written and synthesized in VHDL, Verilog® or some other 
hardware description language. Schematic entry may also be used." 

c) A specific objective of the WISHBONE SoC indicated on p. 1 1 is: U A further objective is 
to create an architecture that fully supports the automatic generation of interconnection 
systems. This allows the WISHBONE interconnection to be generated with parametric 
core generators." 

d) From the Glossary on p. 21 : "Parametric Core Generator. A software tool used for the 
generation of IP cores based on input parameters. One example of a parametric core 
generator is a DSP filter generator. These are programs that create lowpass, bandpass and 
highpass DSP filters. The parameters for the filter are provided by the user, which causes 
the program to produce the digital filter as a VHDL or Verilog® hardware description. 
Parametric core generators can also be used create WISHBONE interconnections." 



Item 6, Altium Unveils new 'Board-on-Chip' Technology , announces a technology known to 
those skilled in the art as being based upon WISHBONE (SoC) interconnection technology. 
With respect to the features of this tool listed on page 2 in this document: 

a) "Mixed schematic/VHDL design capture" 

b) "Processor core packs that combine pre-synthesised processor cores with matching 
compiler, simulator and debugger" 

c) "Integrated software development" 

d) "'Virtual' instruments such as logic analyzers and frequency counters that can be built 
into the design for test purposes" 



Item 7, Altium Introduces Systems Focus to FPGA Design with Nexar , announces a technology 
known to those skilled in the art as: (i) being based upon WISHBONE (SoC) interconnection 
technology and (ii) further advancing the art of parametric core generators and test benches used 
in the product described in Item 6. 
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Respectfully submitted by: 




Wade D. Peterson 



Certificate of Service 



I hereby certify on this 22 nd day of March, 2007 that a true and correct copy of the foregoing 
"Submission of Prior Art" was mailed by first-class mail, postage paid, to: 

Nixon & Vanderhye, PC 
John R. Lastova, Attorney at Law 
901 North Glebe Road, 1 1 th Floor 
Arlington, VA USA 22203-1808 * 
Tel: 703-816-4000 



Wade D. Peterson 
Silicore Corporation 
3500 Vicksburg Ln N. #110 
Plymouth, MN 55447 
TEL: 612-388-9755 



Signed: 
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1.0 Introduction 



The VME64 to PCI Bridge is a System-on-Chip that allows data transfer between a VMEbus 
slave interface and a PCI target interface. It is delivered as a VHDL soft core module that is in- 
tended for use on a Xilinx Spartan 2 FPGA. However, with minor modifications this soft system 
can be implemented on other types or brands of FPGA and ASIC devices. Figure 1-1 shows a 
functional diagram of the bridge. 
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BUF D 
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BUF F 



BUF_H 



<1— 1> 

0—0 
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"<r— 0 

"<3— > 



El 
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c 



^ Control ^ 
N Tin v * 



WCLK 



^4 



WCLK 



WRST 



Q 



FPGA or ASIC Target Device 



pa 



AD[31::00] 



C7BE(3::0] 



PCUC 
33.333 MHz 



V 



N_ConfSerEnl I 
ConfSerDat > | 
ConfSerOk J> 



WatchdogFaultDSP 



-> SPARTAN 1 



-> N_VACFailB 



07 DEC 2002 



Figure 1-1. Functional diagram of the VMEbus to PCI bridge. 
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1.1 Features Of The Bridge 

• VMEbus interface features: ] 

- Silicore VMEcore™ IP Core technology 

- A24:D32:D16 slave interface 

- Posted read and write capabilities 

- Synchronous interface design 

- Conforms to ANSI/VITA 1-1994 

• PCI interface features: 2 

- Xilinx LogiCORE™ PCI IP Core technology 

- D32 target interface 

- 0 - 33.333 MHz operation 

- Conforms to PCI Revison 2.2 

• Shared memory features: 

- Nine, 256 x 32-bit shared memory butters 

. Semaphore control register for each buffer (except BUF H) 

- Buffers use independent memory arbitration 

. Serial EEPROM interface for Atmel AT17 Series of FPGA Configuration ROM. 

. Internal WISHBONE bus features 3 : 

- Conforms to WISHBONE Revision B.2 

- Simple, compact, logical hardware interfaces 

. Straightforward and highly reliable synchronous design. 

. Written in flexible and portable VHDL hardware 

ne „ t s (with the exception ^"^'-^ LppUed. This a.lows 
TJZZ nTin™ SEE design. Complete d— ion is a,so pro- 
vided. 



^ JSSLom and the Xi.inx LOGlcore PCI Design Gu.de. 

A-e~A fr n m onrce code in the public domain. Open source 
3 The interna, WISHBONE SoC ^^^^^Z^^ Service Center at 
WISHBONE SoC components are available on the web rrom ine w 
www.silicore.net/wishbone.htm or from www.opencores.org. 
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1.2 Glossary Of Terms 



Ox (numerical prefix) 

The 'Ox' prefix indicates a hexadecimal number, 
in the 'C programming language. 



It is the same nomenclature as commonly used 



Active High Logic State 

A logic state that is 'true' when the logic level is a binary T (high state). The high state is at a 
higher voltage than the low state. 

Active Low Logic State 

A logic state that is 'true' when the logic level is a binary '0' (low state). The low state is at a 
lower voltage than the high state. 

ASIC 

Acronym for: Application Specific Integrated Circuit. A general term which describes a generic 
array of logic gates or analog building blocks which are programmed by a metallization layer at a 
silicon foundry. High level circuit descriptions are impressed upon the logic gates or analog 
building blocks in the form of metal interconnects. 

Asserted 

(1) A verb indicating that a logic state has switched from the inactive to the active state. When 
active high logic is used it means that a signal has switched from a logic low level to a logic high 
level. (2) Assert: to cause a signal line to make a transition from its logically false (inactive) 
state to its logically true (active) state. Opposite of negated. 

Bit 

A single binary (base 2) digit. 
Bridge 

An interconnection system that allows data exchange between two or more buses. The buses 
may have similar or different electrical, mechanical and logical structures. 

Bus Interface 

An electronic circuit that drives or receives data or power from a bus. 
Bus Cycle 

The process whereby digital signals effect the transfer of data across a bus by means of an inter- 
locked sequence of control signals. 

BYTE 

A unit of data that is 8-bits wide. Also see: WORD, DWORD and QWORD. 
Data Organization 

The ordering of data during a transfer. Generally, 8-bit (byte) data can be stored with the most 
significant byte of a multi byte transfer at the higher or the lower address. These two methods 
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are generaHy called BIG END1AN and » 

D1AN refers to byte lane ordenng where the most ^"'"."j si „„ ifi cant byte is stored at the 
LITTLE ENDtAN refers to b>ne l» ortenng wh«e*e most stg». ^ J- q 

3!5 branny S£5 S In— fences Instiune, and was derived feonr the book 
nMiliv^r's Travels by Jonathan Swift. 

DWORD /?yrr WORD and QWORD. 

A unit of data that is 32-bits wide. Also see. wwicw ^ 



ENDIAN 

See the definition under 'Data Organization'. 



Firm Core conversion into an integrated circuit design, but 

file in the field of computer software design. 

Znynt for: F,e,d Prog— Gate A™, ^^^fflTiSl 

Granularity „o M Mp nf transferring. For example, a 32-bit port 

8-bits. Also see: port size and operand size. 

dtat is delivered in the fonn of a mask se, (i.e. a graphical description of the featnres 
and connections in an integrated circuit). 

Hardware Description Language (HDL) FxamDles include VHDL and Verilog®. (2) 

Ac™"™ for: Intellectual Property Core. Also see: soft core, fir. core and W core. 

level. Also see: asserted. 
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Operand Size 

The operand size is the largest single unit of data that is moved through an interface. For exam- 
ple, a 32-bit DWORD operand can be moved through an 16-bit port with two data transfers. 
Also see: granularity and port size. 

PCI 

Acronym for: Peripheral Component Interconnect. Generally used as an interconnection scheme 
(bus) between integrated circuits. It also exists as a board level interconnection known as Com- 
pact PCI (or cPCI). 

Port Size 

The width of a data port in bits. Also see: granularity and operand size. 
Posted Read and Write Cycles 

A method for minimizing data transfer latency between a data source and its destination. During 
a posted read or write cycle a bus interface, lying between a data source and its destination, cap- 
tures the data and completes any handshaking protocols with the data source. At the same time, 
the bus interface initiates handshaking with the data destination. This alleviates the need for the 
data source to wait until the destination is ready to accept the data. 

QWORD 

A unit of data that is 64-bits wide. Also see: BYTE, WORD and DWORD. 
Register (REG) 

A device capable of retaining information for control purposes that is contained in a single 
BYTE, WORD or DWORD of storage area. Said storage area is not general purpose memory. 
Also see: shared register (SREG). 

Shared Memory (SMEM) 

(1) The address space in a system which is accessible to all modules. (2) A type of memory that 
is shared between two or more ports. Shared memory uses a hardware arbitration scheme that 
allows simultaneous accesses from two or more processors. However, during simultaneous ac- 
cesses one processor may be required to wait until the another has completed its accesses into the 
shared memory area. Also see: shared register (SREG). 

Shared Register (SREG) 

(1) A register space in a system which is accessible to all modules. (2) A type of register that is 
shared between two or more ports. Shared registers uses a hardware arbitration scheme that al- 
lows simultaneous accesses from two or more processors. However, during simultaneous ac- 
cesses one processor may be required to wait until the another has completed its accesses into the 
shared memory area. Also see: register, shared memory (SMEM). 

SoC 

Acronym for System-on-Chip. Also see: System-on-Chip. 
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^PCere .ha. is deiivered in the fan. of a hardware description .anguage or schematic dia- 



gram. 



System-on-Chip (SoC) , integrated circuit chip. In many cases, 

Jhis^^^ 

finished system. 

Target Device ^u nrt , 0< nA onto which the IP core design is impressed. Typical 

The semiconductor type (or technology) onto wmcn 
examples include FPGA and ASIC target dev.ces. 

Zn'm for: VHSIC Hardware Description i^J^^^^S^ 
circuit]. A textual based computer WJ^^SU^^ 
guage is both a synthesis and a simulator tool Early to ^ and 

S. I^Z^^^^ « * - IEEE 1076 ' IEEE 10733 ' and 

IEEE 1164 specifications. 

Ic^^onyersaM^eE^ A lar m—r (hoard, hu, S—zed 
under IEC 821, IEEE 1014 and ANSI/VITA 1-1994. 

rsf— ch iP (soc,d^ 

main standard. 

Wrapper wtSHRONE IP Core into a WISHBONE compatible IP 

A circuit element that converts a ^"/> n " W f J®^ s prim i tive that is provided by an IC 

Core. For example, consider a 16-byte «^?^JSffiONE compatible SLAVE by layering 
vendor. The memory primitive can be ^f c ^™^ H BONE compatible SLAVE. A 

w ^ ^Z^^^^ Uare ^ in ' c t0 that wntten m 

'C++'. 

Zfof data tha. is 16-bi.s wide. Aiso see: and QWORD. 
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1.3 Recommended Skill Level 



It is recommended that the user have some experience with VHDL syntax and synthesis before 
attempting to integrate this core (or almost any other HDL core for that matter) into an FPGA or 
ASIC device. Most VHDL users report a fairly stiff learning curve on their first project, so it's 
better to have that experience before attempting to integrate the core. Prior experience with one 
or two medium size VHDL projects should be sufficient. On the other hand, some users may 
find the integration of the core a good way to learn many of the concepts in the VHDL language. 
Those users should find the integration experience rewarding. 
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2.0 System Architecture 



The VME64 to PCI Bridge connects a VMEbus slave interface to a PCI target interface. Com- 
munication between the two sides of the bridge is made through a set of four control registers, 
seven semaphore registers and nine shared memory buffers. Data is loaded into a buffer on one 
side of the bridge, and unloaded from the other side. The semaphore registers can also be used 
by system software to determine when and if a buffer is being used. The VMEbus side of the 
bridge also includes an EEPROM interface for programming Atmel AT 17 series devices. Table 
2-1 shows the address map from both sides of the bridge. 



Table 2-1. Address Map. 




VMEbus 


PCI 








BYTE Address 4 


BYTE Address 


Name 


Type 




(Base Offset) 


(BAR Offset) 






Tvoes 


0x00000 


0x0000 


DMC HW CONTROL 


SREG 


RAV 


0x00004 


(*) 


CONFIG PROM CMD 


REG 


RAV 


1 0x00008 


(*) 


CONFIG WRITE DATA 


REG 


R/W 


i oxooooc 


(*) 


CONFIG READ DATA 


REG 


R 


0x00010 


0x0010 


DMC CMD 


SREG 


RAV 


! 0x00014 


0x0014 


DMC FAULT 


SREG 


R/W 


i 0x00018 


0x0018 


DMC STATUS 


SREG 


RAV | 


0x000 1C 


0x00 1C 


SPEC1ALREG 


SREG 


R/W 


0x00020 


0x0020 


SEM BUF A 


SREG 


R/W 


0x00024 


0x0024 


SEM BUF B 


SREG 


R/W 


0x00028 


0x0028 


SEM BUF C 


SREG 


R/W 


0x0002C 


0x002C 


SEM BUF D 


SREG 


R/W 


0x00030 


0x0030 


SEM BUF E 


SREG 


RAV 


0x00034 


0x0034 


SEM BUF F 


SREG 


R/W 


0x00038 


0x0038 


SEM BUF G 


SREG 


R/W 


0x0003C - 0x007FF 


0x003C - 0x07FF 


Unused / Unreserved 






0x00800 -OxOOBFF 


0x0800 - OxOBFF 


BUF A 


SMEM 


R/W 


OxOOCOO - OxOOFFF 


OxCOO - OxOFFF 


BUF B 


SMEM 


R/W | 


0x01 000- 0x01 3FF 


0x1000 -0xl3FF 


BUF C 


SMEM 


R/W i 


0x0 1400- 0x0 17FF 


0x1400- 0x1 7FF 


BUF D 


SMEM 


R/W 


0x0 1800- 0x0 1BFF 


0x1 800- 0x1 BFF 


BUF E 


SMEM 


R/W 


OxOlCOO-OxOlFFF 


OxlCOO-OxlFFF 


BUF F 


SMEM 


R/W 


0x02000 - 0x023FF 


0x2000 - 0x23FF 


BUF G 


SMEM 


R/W 


0x02400 - 0x027FF 


0x2400 -0x27FF 


1 BUF H 


SMEM (**) 


R/W 


0x02800 -OxOFFFF 


0x2800 -OxFFFF 


! Unused / Unreserved 






0x10000 -0x7FFFF 




Unused / Unreserved 







Notes: 

Access types: 'R': read cycle, 'W: write cycle. 

Type: REG (register), SREG (shared register) and SMEM (shared memory) are defined in the glossary. 
(*) Not implemented on the PCI side of the bridge. 

(**) 'BUF H' is formed from Xilinx distributed RAM, and is the only region that will accept PCI burst transfers. All 
other buffers will only respond to single read and write cycles. 



4 By definition, VMEbus and PCI addresses are specified at byte locations. The first address given in the table is the 
byte address that should be used when accessing the register or memory area using DWORD (32-bit) values. 
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2.1 VMEbus Interface 

• u axtqt/vtta 1 1994 It has a D32:D16 data in- 
The VMEbus slave interface is compatible with AN T™ I? Kbyte boundaries. Stated another 
terfaee, and responds to A24 SLAVE ^^"f^Kb^Tocarions of 0x000000, 
way, the interface responds to A24 base a, base address OxOCOOOO, 

r SmcTmD be°ac e ceS':i OxiOOOOO + 0x0000,0 = OxiOOOiO. 

A „ registers and buffer memo,, -areas 1 tave ^port ^^KLIX'S 
(WORD) granularity, meaning that they can oe ac<^ 
cesses are not supported. 

The VMEbus interface is ^^^^^^^^^ 
Ss and DWORD (32-bit) accesses on 4-byte boundar.es . 

A1 , VMEbus accesses to "unused' address areas are terminated with the VMEbus [BERR*] * 



2.2 PCI Interface 

The PCI target interface is implemented with a Xi.inx LogiCORE PCI interfaced Unless other- 
SCp^ible wSh the PCI Version 2.2 bus spe.ficat.on. 

The target interface responds to the following PCI command cycles: 

. Configuration Read (CBE[3:0] = 1010) 

. Configuration Write (CBE[3:0] = 1011) 

. Memory Read (CBE[3:0] = 01 10) 

. Memory Write (CBE[3:0] = 01 11) 

. Memory Read Multiple (CBE[3:0] = 1 100) 

. Memory Read Line (CBE[3:0] = 1110) 

in the BAR0 register. 



^^^^ t A24 ^S^S^^^^^^ 2" Umited 

« This caveat is required because the target dev.cefo n £* Memory architecture (granularity) of 16-b.ts. 
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All registers and buffer memory areas located on the PCI target interface have port sizes of 32- 
bits (DWORD). All have 16-bit (WORD) granularity, meaning that they can be accessed as 16 
or 32-bit quantities. The bridge uses a 16/32-bit data interface, which means that it can read and 
write 16-bit WORD or 32-bit DWORD values. The interface does not support BYTE accesses. 

All PCI accesses to 'unused' address areas are terminated with a PCI TARGET ABORT. 

Special 'wrapper' circuitry connected to Xilinx LogiCore PCI interface controls how the target 
interface responds to the PCI commands. During the initial phase of a PCI burst cycle, a binary 
counter (located inside the wrapper circuitry) latches the starting PCI address. The counter then 
increments during subsequent phases within the burst cycle, thereby generating the next address. 
The counter is incremented after every cycle that accesses the high byte of a 32-bit DWORD 
transfer. The high byte is indicated when [S_CBE(3)] is low. That means that the address 
counter is incremented after every 32-bit transfer or after a high order 16-bit transfer. 

PCI single and burst transactions are supported by the bridge. However, burst transactions 8 are 
not supported by all memory locations or registers. If a burst transaction is attempted in a mem- 
ory region that does not support burst transfers, then the bridge responds with a TARGET 
ABORT termination 9 . 

Reading or writing to more than one 'SREG' or 'SMEM' area during a single PCI burst cycle 
may result in an unexpected behavior, and is not recommended. 

Also note that the 8-bit address counter rolls over from OxFF to 0x00 during burst cycles. This 
means that burst transfers cannot cross boundaries from one shared memory to another. For ex- 
ample, a 256 DWORD burst to the middle of 'BUF_H' will wrap around to the beginning of the 
same buffer. 

The Xilinx LogiCORE PCI handles all transactions with the bridge. The core is implemented as 
a PCI target, meaning that it cannot initiate transactions. This manual does not attempt to define 
the operation of the Xilinx LogiCORE PCI interface. 

The PCI core returns a PCI Device ID and a PCI Rev ID that are unique to the VMEbus to PCI 
Bridge core. These are returned from PCI configuration registers 0x00 and 0x08 respectively. 
The PCI Device ID always returns 0x030. The PCI Rev ID identifies the hardware revision level 
as shown in the 'Manual Revision' section at the front of this manual. 



8 Burst transactions are bus cycles with more than one data transfer phase. 

9 Burst transactions in excess of one data transfer are allowed as long as the memory structure supports it. This is 
because the core can be implemented on a number of target devices, memory types and speeds. The Xilinx Logi- 
Core PCI requires that memories must support single clock data transfers. If this type of memory is implemented on 
the device, then the burst operation is supported. If this type of memory is not implemented with the core, then burst 
transactions are not supported. For more information please refer the Hardware Reference section of this manual. 
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2.3 Register Descriptions 

Unl ess otherwise noted, the VMEbus and PC, shared registers (SREG, have identical hi, descrip- 



tions. 



2.3.1 DMC_HW_CONTROL Register 

Tables 2-2 and 2-3 show the operation of the DMC_HW_CONTROL Register. 

?^TlT)MC HW CONTROL Definitio7 




D01 
D02 



D03 



D04 



SysfailLocalDriver 
SysfailSystemMonitor 



AcfailSystemMonitor 



PCIResetMonitor 
Unused / Unreserved 



0 = Assert SYSFAIL (*) 
1_= Neg ate SYSFAIL 

0 



0 = SYSFAIL asserted 

1 = SYSFAIL negated 

0 = ACFAIL asserted 

1 = ACFAIL negated 



0 = No PCI reset 

1 = PCI r eset 
0~ 



D05-D31 

!SLs set unused bits to '0' to support future upgrades. 

(2) Son after VMEbus reset or device configuration denoted by. ( ). 
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Table 2-3. DMC HW CONTROL Detailed Description 


Bit# 


Detailed Description 


D00 
DMCReset 


WRITE: Clearing this bit asserts the [SPARTAN 1] output pin on the 
device. Setting this bit negates the [SPARTAN 1] output pin. [SPAR- 
TAN 1] is an active high signal. 

READ: Returns the current state of the WRITE bit. 


D01 

SysfailLocalDriver 


WRITE: Clearing this bit causes the local SYSFAIL* driver to assert 
the VMEbus SYSFAIL* signal. Setting this bit negates the local 
SYSFAIL driver. For more information about the operation of this bit, 
please see the SYSFAIL* Operation section of this manual. 

READ: Returns the current state of the WRITE bit. 


D02 

SysfailSystemMonitor 


WRITE: Always set to 4 0' to support future upgrades of this register. 

READ: Returns the current state of the VMEbus SYSFAIL* signal. 
When cleared, the VMEbus SYSFAIL* signal is asserted (i.e. failure 
mode). When negated, the VMEbus SYSFAIL signal is negated. 


D03 

AcfailSystemMonitor 


WRITE: Always set to '0' to support future upgrades of this register. 

READ: Returns the current state of the VMEbus ACFAIL* signal. 
When cleared, the VMEbus ACFAIL* signal is asserted (i.e. AC power 
failure mode). When negated, the VMEbus ACFAIL* signal is ne- 
gated. 


D04 

PCIResetMonitor 


WRITE: Always set to '0' to support future upgrades of this register. 

READ: Returns the current state of the PCI [RST#] signal. When set, 
the PCI bus is in its reset condition. When cleared, PCI bus is in its 
'run' condition. 


D05-D31 
Unused / Unreserved 


WRITE: Always set to 6 0' to support future upgrades of this register. 
READ: Always returns '0'. 



The VME64 to PCI Bridge supports a standard implementation of the SYSFAIL* line. This in- 
cludes both its start-up diagnostic and run-time failure capabilities. While these capabilities are 
not defined by the VMEbus specification, they do conform to standard industry practices. 

SYSFAIL* is an open-collector class VMEbus signal. That means it operates as a 'wire-nor' 
circuit that is asserted if one or more VMEbus modules drives it low. It is negated only after all 
VMEbus modules negate their on-board SYSFAIL* drivers. 
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The standard start-up diagnostic operation of YSFA^ - t^S^^^ 
practice, SYSFAIL* is asserted by all, "^"3^^ are operating correctly. As 
After SYSRESET* is negated, all modul f s are ^ted to see it th y P oniy ^ ^ 

each module passes its test it negates ^^J^s a standard industry practice where 
modules pass these diagnostic ^^«J~^^d to be 'bad' until proven 'good'). 

SYSRESET* \ ■ ' self test performed 

^SYSML* asserted > 

■ / by on all modules L 
SY smi* \ after SYSRESET*. ' 




SYSFAIL* negated 
after self test 
passes on all modules . 

19 SEP 2002 



EigUS 7.1 . Stan '--- 1 ""P 1 ™ SYSFMI * 

Very often a redden LED is attached ,0 a ^^^^J^Z^ 
diagnosis ofthe systen, atb octuple y an » P ^ £ ^ In «. 

simply asserts SYSFAIL*. 

The VME64 ,0 PC, Brid g e has ^^^^X^S 
SYSFAIL* operation. Control and status bits are avanaoie 

VMEbus register. 

The SYSFAIL* control and status bits are 

(ESTOP) button. ESTOP buttons are a common p ™ in the chai n causes the 

ally arranged into a 'wire-or' chain ^hereby PJ^g ^ ^""Sard technique in all sys- 
system to shut down in a known and controllab^ EST OP is a large red 

^ ^^XZ^TZ hazardou/conditions (such as de- 
energizing high voltage power supplies). 

A common practice in ^-rTS^^^ 
TOP chain. 
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The VME64 to PCI Bridge provides this capability on both sides of the bridge. Furthermore, 
each side of the bridge can monitor the other side of the bridge. For more information, please 
refer to the register descriptions. 

2.3.2 Configuration EEPROM Registers 

The Configuration EEPROM Interface assists in programming of the Atmel ATI 7 Series of 
FPGA Configuration EEPROM. The interface is configured by three registers: 

o CONF1GPROMCMD: Command/status register 
o CONFIGWRITEDATA: Write data register (8-bits). 
o CONFI G_READ_DATA : Read data register (8-bits). 

The user is encouraged to study the following data sheets (provided by Atmel) before program- 
ming a device: 

1) Programming Specification for Atmel 's ATI 7 and AT 17 A Series FPGA Configuration 
EEPROMS. Atmel Corporation 2002. See www.atmel.com. 

2) AT17LV040 FPGA Configuration EEPROM Memory (data sheet). 

The CONFI G_PROM_CMD register is shown in Tables 2-4 and 2-5. The bits in this register 
control the various operations of the interface. The CONFIG_READ_DATA and CON- 
FI G_WRITE_D ATA registers are for reading and writing data to the EEPROM. Load the lower 
eight bits of the CONFIG WRITE DATA register before issuing a write command, and read the 
lower eight bits of the CONFI G_READ_D AT A register after completion of a read command. 

The EEPROM interface can be reset (initialized) in one of two ways: (1) under hardware control 
in response to a system reset (via the WISHBONE reset line [RST I]) or (2) under software con- 
trol in response to the 'ResetPromlnterface' bit in the CONFI G_PROM_CMD register. 

After a system reset the 'ResetPromlnterface' bit is set by hardware. This causes all sections of 
the EEPROM interface to be initialized. The reset operation may take some time to complete, as 
the interface operates with a clock speed of about 400 KHz. This means that the reset interval 
can last in excess of 3 - 6 microseconds. During this time all processor inquires to the 'Reset- 
Promlnterface' will return a 'BUSY' state. While busy, the host processor should not attempt to 
initiate EEPROM accesses. 

To reset the EEPROM interface, the VMEbus host processor should perform the following: 

1) Reset the interface by setting the 'ResetPromlnterface' bit (D00) in the CON- 
FIG_PROM_CMD register. 

2) Monitor bit D00 ('BUSY') until the bit is cleared. 

3) Perform other operations as needed. 
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Bit# 
DOO 



DOS 



D06 



D07 



D08 



a roNFTO PROM CJMD_ 



Name 
ResetPromlnterface 

SendStartCondition 

SendStopCondition 

WriteByteWithAck 

ReadByteWithAck 



ReadByteWithStop 



AckStat 



ConfSerEn 



ManualConfigOverride 
Manual Override) 
ConfSerClkOverride 
(Manual Override) 
ConfSerDatOverride 
(Manual Override) 



ConfSerDat 
Manual Override) 
Unused / Unreserved 



Write 

0 = Operate interface (*) 

1 = Reset interface 
0=No operation (*) 

1 = Send start condition 
0=No operation (*) 
1 = Send stop condition 

0 = No operation (*) 

1 = Write data byte 
0 =No operation (*) 
1 = Read data byte 

0 = No operation (*) 

1 = Read data byte 



Read 
0=Not busy 
1 = Busy 



0 = Not busy 

1 = Bus) 

0 = Not busy 

1 = Busy 



0 

0 = Disable configuration (*) 

1 = Enable configuration 

0 = Normal operation (*) 

1 = Manual configuration 

0 = ConfSerClk cleared (*) 

1 = ConfSerClk set 

0 = ConfSerDat cleared (*) 

1 = ConfSerDat set 
0 



0 = Not busy 

1 = Busy 



0 = Not busy 

1 = Bus: 

0 = Not busy 

1 = Busy 



0 = ACK received 

1 =No ACK received 
Readback 

Readback 



Readback 
Readback 
Input State 



0 
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Table 2-5. CONFIG PROM CMD Detailed Description 


Bit# 


Detailed Description 


D00 

ResetPromlnterface 


WRITE: Setting this bit resets the EEPROM interface. Negating the bit 
has no effect. This bit need only be set once to initiate a reset. 

READ: Returns the current state of the reset that was initiated by as- 
serting the WRITE bit. The bit returns T while the interface is busy 
implementing the reset operation, and '0' when it has completed. 

NOTE: a 'ResetPromlnterface' command is initiated in response to a 
system reset or device configuration. That means that host processor 
could read this bit as a ' V shortly after a system reset. However, the bit 
will be eventually be negated when the EEPROM interface has com- 
pleted its reset cycle. 


D01 

SendStartCondition 


WRITE: Setting this bit initiates an EEPROM START CONDITION. 
Negating the bit has no effect. 

READ: Returns the current state of the START CONDITION which 
was initiated by asserting the WRITE bit. The bit returns 5 1 5 while the 
interface is busy implementing the START CONDITION, and 4 0' 
when it has completed. 


D02 

SendStopCondition 


WRITE: Setting this bit initiates an EEPROM STOP CONDITION. 
Negating the bit has no effect. 

READ: Returns the current state of the STOP CONDITION which was 
initiated by asserting the WRITE bit. The bit returns 6 1 5 while the in- 
terface is busy implementing the STOP CONDITION, and '0' when it 
has completed. 


D03 

WriteByteWithAck 


WRITE: Setting this bit writes a byte of data to the EEPROM. Negat- 
ing the bit has no effect. Before setting this bit the data must be loaded 
into lower eight bits of the CONFIG_WRITE_DATA register. 

READ: Returns the current state of the write operation which was initi- 
ated by asserting the WRITE bit. The bit returns ' 1 ' while the interface 
is busy writing data, and '0' when it has completed. The state of the 
'ACK' bit (returned by the EEPROM) is indicated by bit D06 (below). 
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Table 2-5. < 


PONFIG^ROM^CMD Detailed DescnBtigjiicont) 

Detailed Description _ - 


Bit# 

D04 t 

i 

t-» in. ,4- 7 1 4-1-1 A r*V 

ReadByte w ltnACK 


ellL nht ACK CONDITION. Negating the bit has no ef- 
feet. 

is busy writing data, and '0' when it has completed. 

Note: After completion of the 'ReadByteWithAck' command *e 
EEPROM data byte is returned in the lower eight bits of the CON 
Fir, READ DATA register. iTTrrPPOM nnrt 


D05 

ReadByte w unstop 


- ' 1TF . c e ninglh¥biniads a byte of data from the LhFRUM ana 
WRITE. Sett in 8 m,s ™ ' e COND ir 10 N. Negating the bit has no ef- 
terminates it with a STOP CONDI iwn- g * 
feet. The data is returned in the CONFlG_KfcAU_uA a & 

is busy writing data, and '0' when it has completed. 

Note- After completion of the '^^^^'^^1^. 
EEPROM data byte is returned in the lower eight bits of the CON-j 

FIG READ DATA register. _ 


D06 
AckStat 


" WRITE: Always write a '0' to this location. 
READ- Returns the current state of the ACK bit after implementing the 

to QCicrrniiic n ^ 


D07 

J ConfSerEn 


to read or write to the EEPROM. Always negate it when completed. 
PF AD- Returns the state of the WRITE bit. 
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Table 2-5. CONFIG PROM CMD Detailed Description (con't) 


Bit # 


Detailed Description 


D08 

ManualConfigOverride 


WRITE: Clearing this bit allows the interface to operate normally. Set- 
ting this bit overrides all other functions and allows the ConfSerClk 
and ConfSerDat to be operated manually. This function is used for test 
and debugging purposes. 

READ: Returns the state of the WRITE bit. 


D09 

ConfSerClkOverride 


WRITE: asserts or negates the ConfSerClk signal when the Manual- 
ConfigOverride bit is set. 

READ: Returns the state of the WRITE bit. 


D10 

ConfSerDatOverride 


WRITE: asserts or negates the ConfSerDat signal when the Manual- 
ConfigOverride bit is set. 

READ: Returns the state of the ConfSerDat signal. 


Dll 

ConfSerDat Input 


WRITE: Always write a '0' to this location. 
READ: Returns the state of the ConfSerDat input. 


D12-D31 
Unused / Unreserved 


WRITE: Always write a '0' to this location. 
READ: Always returns '0\ 



When programming a configuration EEPROM the user is directed to the programming algorithm 
in the Atmel Programming specification. The interface provides much of the timing. For exam- 
ple, Figure 2-2 shows how the interface implements the first few bytes of data shown in the 
"Write to Whole Device" algorithm. 

The Atmel ATI 7 series parts must be programmed at a speed that is sufficiently high to guaran- 
tee the 'Write Cycle Time' (Twr) indicated in the data sheet. Stated another way, data written to 
the device must be delivered within a minimum length of time. For example, the Write Cycle 
Time of the Atmel AT17LV040 is specified as 25.0 ms (max). This means that the theoretical, 
maximum time required to deliver the data to the EEPROM is: 

256 data bytes/page x 9 bits/byte 10 = 2,304 data bits/page 

4 address bytes/page x 9 bits/byte = 36 address bits/page 

2304 data bits/page + 36 address bits/page = 2,340 bits/page 



An address or data byte includes eight bits of information plus one acknowledge bit, or 9 bits total. 
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The EEPROM interface" operates at an eloek speed (ECLK) ^J^^^^Z 
data bit requires 4 cloek eycles to complete. This means that the minimum, total Wnte Cycle 

TimC 1S: 2,340 bits/page x 4/393 KHz = 23.8 ms 

very' small guard band (1.2 ms) to meet the specification. 

way, a new ' WriteByteWithAck' command must be issued immediately after the BUSY bit has 
been negated from a previous command. 

rw th* FFPROM interface hardware has written a data byte to the device with the 'Write- 
B negates J. ^^^ T ^^XZ 

Cw^SS 1£& pTe""e A J^ t he mterface win. prob. 
KfJSC However, if [Tav] is exceeded for too many cycles, then .here ,s a nsk that 
maximum Write Cycle Time of 25.0 ms could be exceeded. 

The maximum time available [Tav] from 'BUSY' negated to 'WriteByteWithAck' set is found 
from the relation: 

Tnv = ! - ECLKsetup 

ECLK CLKJ 

Given an EEPROM clock [ECLK] of 393 KHz, a WISHBONE clock speed [CLKJ] of 33.0 
MHz and [ECLKsetup] of 1 WISHBONE clock yields a [Tav] of: 



Tm ,_ 1 ._J? 1 — = 2.34 uS 

393KHz 33MHz 33MHz 



Important Notice: 

The first byte on the AT17 series EEPROM cannot be reliably read by the FPGA unless 
power is eycled after programming. For more informat.on, please refer 
to the programming instructions. 



TT^— i^^o^EPROM interface please refer to the CEEPROM. VHD' hardware reference 
located elsewhere in this manual. 
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Load Device Hdress 

|K\b or mi 
hxa OXnGJSITEJKS. 

tteiteEytgaithSck • 1 




Load US of EEFKM address 
into CQNFIGB(ITE_DHA 

ftitaBTtcsethlld * 1 




17 AUG 2002 

Figure 2-2. High level software control algorithm for the EEPROM Interface. 
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2.3.5 DMC_CMD Register 

DMC CMD is a 32-bit, user defined, general purpose read/write register. 



2.3.6 DMC_FAULT Register 

DMC.FAULT is a 32-bit read/write register, with the bit definitions shown in Table 2-6 

t.kio ->jk nMC FAULT Definition 

Write 



Bit# 
D00 

D01-D31 



Name 
WatchdogFaultDSP 

User defined 



0 

Latched 



Read 
0' = No timeout 
T = Timeout 
Returns current state 



2.3.7 DMC_STATUS Register 

DMC.STATUS is a 32-bit, user defined, general purpose read/write register. 

2.3.8 SPECIALREG Register 

SPEC1ALREG is a 32-bit, user defined, general purpose read/write register. 



2.3.9 PCI_SEM_BUF_(A-G) 

CC w r>i yp ( A CA are used by VMEbus or PCI system proces- 
The seven semaphore registers SEM_BUF_(A G) ar 2 . 7 is represe ntative of the 

— — • • — 

BUF(A-G) shared buffer memory. 



Table 2-7. SEM BUF (A-G) Definition 



Bit# 
D00 



Read 
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The 'SemaphoreBufferN' bit indicates whether a VMEbus or PCI system processor has obtained 
the memory space. The semaphore does not lock the memory buffer but just provides a mecha- 
nism for software to determine if the particular memory buffer is being used. If the semaphore is 
not used, or is disregarded by software, the associated buffer arbitration still operates normally. 

The semaphore is accessed by reading the bit. If the bit is returned as '0', then the processor has 
obtained ownership of the buffer. If the bit is returned as T, the buffer is busy. If a processor 
obtains the buffer by reading '0' (becomes the owner), and the bit is sampled for a second time, 
then the bit is returned as 6 V on the second access. 

If a VMEbus and PCI semaphore access occurs at the same time (i.e. during the same clock cy- 
cle), then the VMEbus access has priority. 

The buffer is released by writing a '0' to the semaphore. Writing a T to the bit does not have 
any effect. The buffer may be released from either the VMEbus or PCI side of the bridge. 



2.4 Operation of Shared Memory Buffers and Registers 

The VME64 to PCI Bridge contains several different types of memories and buffers. These are 
classified as SMEM (shared memory), REG (register) and SREG (shared register). The classifi- 
cation for any particular memory or register can be found in the VMEbus and PCI address maps 
located elsewhere in this manual. 

SMEM and SREG types are shared by both sides of the bridge 12 . Each has its own arbiter circuit 
that resolves any contention between the two sides of the shared resource. For the most part, this 
operation is transparent to the software programmer. However, in some cases it is important to 
understand how the arbitration works. 



2.4.1 Shared Buffers 

There are eight shared memory buffers named 'BUF_A' through 'BUFH'. These can be used 
to pass data between the two sides of the bridge. They operate as 32-bit wide memories with 16- 
bit granularity. This means that information can be passed in WORD (16-bit) and DWORD (32- 
bit) data formats. 

Each buffer operates as an independent shared memory. This means that both sides of the bridge 
supports full, simultaneous, read/write privileges into each buffer. This provides the software 
designer with a very flexible mechanism for moving data, as well as an excellent way to test 
memory from both sides of the bridge. 

There may be some undesirable side effects as a result of contention within the shared areas. 
Specifically, the interface may slow down because a hardware arbiter must grant accesses to one 



12 A third type called 'REG' (register) is a non-shared register. These operate independently from the opposite side 
of the bridge. 
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*. of the bridge or the othen To *via* dtis = ^ = « 
SCtS " £. - Sf = - - —us memory co, 
S so neither side of the bridge ever waits for an aeeess. 

As an option, the software ^"*^.',SSSr 

can be used to determine which stde of the bridge owns aparac deter mine if 

S^Ki: SpnT ^ofused, or is disregarded by 

S>e locM buffer arbitration still operates normaily. 

The f,rs, seven m W ou#- have an ass^ — ^ fjSou^lt 
^ d ^X S SfflSS^S ..pes are identic.. An eighth 
Se tT»V Z no. havt: sentaphore register associated with ... 



2.4.2 Hardware Arbitration 



AccesS es ^ both the VMEbus and PCI side of the 

contested. When simultaneous accesses take ^p ^ ce l ^ sorthe p Cl sideofthe bridge. This 
hardware arbiter holds off the access from SMEM or SREG contains an 

means that only one access can take place at any given time, t* 



identical arbiter. 



« hnth the VMEbus and PCI sides of the bridge (over a single 
If a simultaneous access occurs from both tte VMQ« ana $ide Qf the bridge; 

clock period), then the hardware arbiter will grant the access 



and hold off the PCI side 



2.4.3 PCI Accesses 



Each register or buffer arbiter assigns £ tt" 

side of the bridge wins the arbitration, then . holds Ok re our ^ ex _ 

da. This means that arbitration only takes pfcee one a, the tag- « ^ 

ample, if the PCI side of the bridge does a buns transte. Mo m remains 

automatically relinquished 13 . 



see the 'Memory Requirements' section of th.s manual. 
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2,4.4 VMEbus Accesses 



The VMEbus side of the bridge works somewhat differently. That side of the bridge supports 
posted read and write operations. During a posted read or write operation the VMEbus interface 
captures the data and completes any handshaking protocols with the data source (e.g. a memory 
buffer). At the same time it initiates handshaking with the data destination. This alleviates the 
need for the data source to wait until the destination is ready to accept the data. 

During a VMEbus posted write operation to an SREG or SMEM region, the bridge does three 
things: (1) it captures the VMEbus data, (2) it asserts the DTACK* signal to begin termination of 
the VMEbus cycle and (3) it begins writing the data to its destination. The data is held in the 
posted write latch 14 until the destination arbiter is ready to accept it. When ready, the interface 
completes the data transfer into the SREG or SMEM region. 

Once the VMEbus posted write latch has captured the data, it continues to hold it until the desti- 
nation is ready to accept it. In SREG regions the data is delivered within a few clock cycles be- 
cause the PCI side of the bridge is limited to single transfer cycles. However, in SMEM regions 
it could take some time for the data to be delivered to its destination. For example, if a 33.333 
MHz PCI interface were to transfer 512 words of data during a burst transfer, then the delay be- 
fore data is accepted is at least 15 uS: 

512 WORDS/BURST x 1/(33.333 MHz x WORD) = 15 uS / BURST 

Once the VMEbus posted write latch has captured the data, it prevents the interface from re- 
sponding to further VMEbus cycles until the data has been transferred to its destination. This 
prevents subsequent VMEbus cycles from overwriting the posted write data latch. The VMEbus 
cycle will be accepted only after the posted write data has been transferred from the latch. 

Under these circumstances the waiting period could be long enough to trigger some VMEbus 
BERR* watchdog timers (depending upon how they are configured). However, if software uses 
the semaphore registers (SEM_BUF_A, etc.) to determine ownership of a buffer, then data will 
always be delivered within a few clock cycles. 

The VMEbus posted read cycles operate in a similar, reverse manner. During read cycles from 
an SREG or SMEM region the VMEbus interface waits for the arbiter to grant the source to the 
interface. During this interval the VMEbus MASTER participating in the cycle waits until the 
data is ready. Once the arbiter has granted ownership of the data source to the VMEbus interface 
it does three things: (1) it captures the read data in the posted read latch, (2) it completes hand- 
shaking with the data source and (3) it asserts the VMEbus DTACK* signal to begin termination 
of the cycle. The data is held in the posted read latch until the VMEbus cycle is completed. 

The posted read capability prevents the VMEbus side of the bridge from 'hogging' the SREG or 
SMEM area. Once arbitrated, the source data is delivered to the posted read latch in a few clock 
cycles. Once the data is delivered, the arbiter can service any pending accesses on the PCI side 
of the bridge. 



14 The VMEbus posted write latch is physically located in the VPWWRAP hardware section. 



29 



2.5 Reset Operation 

T.e VME64 to PCI Bridge SoC responds ro bo* £«tS^S 
SSS-S iJW* 35S^=S US, response . *e PC, ^ 
signal. 
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3o0 Hardware Reference 



Most of the VME64 to PCI Bridge SoC was created and delivered in the VHDL hardware de- 
scription language. VHDL source code must be synthesized before operation on a particular tar- 
get device (such as an FPGA or ASIC). A variety of simulation, synthesis and CASE 15 tools can 
be used with the core. 

Most of the components used by the core are provided with the source code. However, there are 
a few exceptions. RAM and I/O drivers must be synthesized with entities provided by the FPGA 
or ASIC vendor. That's because portable, synthesizable RAM and ROM elements are not sup- 
ported by the VHDL standards. Furthermore, the SoC uses a PCI target interface that was made 
from a Xilinx LogiCORE PCI element. Xilinx does not provide source code for this core, but 
rather a 'firm core' in the form of a proprietary Xilinx \ngo' file. This file is combined with the 
rest of the SoC at route time. 

With the exceptions noted above, the VME64 to PCI Bridge is provided as a 'soft core'. This 
means that all VHDL source code and test benches are provided with the design. This enables 
the user to see inside of the design, thereby allowing a better understanding of it. This is useful 
from both a design and test standpoint. From a design standpoint the user can tweak the source 
code to better fit the application. From a test standpoint, it allows the user to create custom test 
benches that incorporate both the core and other entities on the same IC. 

The soft core approach allows the VME64 to PCI Bridge to be synthesized and tested with a va- 
riety of software tools. This reduces the cost of special VHDL development software. Users 
should verify that their software tools conform to the IEEE standards listed in the next section of 
this manual. 



3.1 VHDL Simulation and Sym thesis Tools 

It is assumed by Silicore Corporation that all simulation and synthesis tools conform to the fol- 
lowing standards 16 : 

o IEEE Standard VHDL Language Reference Manual, IEEE STD 1076-1993. 
o IEEE Standard VHDL Synthesis Packages, IEEE STD 1073.3-1997. 
o IEEE Standard Multivalue Logic System for VHDL Model Interoperability, IEEE STD 
1164-1993. 

In most cases the VHDL source code should be fully portable as long as the simulation and syn- 
thesis tools conform to these standards. However, if incompatibilities between the source code 
and the user's tools are found, please contact Silicore Corporation so that the problem can be re- 
solved. 



15 CASE: Computer Aided Software Environment 

16 Copies of the standards can be obtained from: IEEE Service Center, 445 Hoes Lane, P.O. Box 1331, Piscataway, 
NJ USA 08855 (800) 678-4333 or from: www.ieee.org 



31 



a.a .hat the user have a set of VHDL simulation tools before integrating 
It is strongly recommended that the .user lave a set confidence that the core syn- 
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'^— ^ " ^ dCTiM - ^ f °"° Wing " 
in the source disk examples: 

. Xilinx integrated Software Environment (ISE) 4.2 (FPGA Router) 
• Xilinx XST Tools (VHDL Synthesis) 
. ModelSim 5.5b (VHDL Simulator) 

AD origina, VHDL source files have been ^^"buS m^hatS 



3.2 VHDL Portability 

^ v«r!«Mwi tend to synthesize unusual logic in some VHDL 
. No variable types are used. Vara* les/ tend tc >symn< es For example, all count- 

rsai^* variable 

types are used in the test benches, however. 

three-state buses well, SO me VHDL synthesis tools will auto- 

S^^bt^on .r—ors. This perfecrly acceptable if 
the target device supports them. 

cation 
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resets and presets very well. The asynchronous resets and presets are less portable, but 
are still supported by most devices. Asynchronous presets are least portable, and have 
been eliminated from the design. 

o Asynchronous, unintended latches have been eliminated from the design. These are 
usually the result of incompletely specified if-then-elsif VHDL statements. 

o Each source file contains one entity/architecture pair.. Some simulator and synthesis 
tools cannot handle more than one entity/architecture pair per file. 



3.3 Required Resources on the Target Device 

The logic resources required by the VME64 To PCI Bridge are fairly common, and are available 
on many FPGA and ASIC target devices. However, before synthesis the user should confirm 
that the following elements are available on the target device: 

o At least two global, low skew clock interconnects for [PCLK] and [ECLK]. Most of 
the logic in the core is synchronous, and the global clocks coordinates all of the internal 
activity. 

o Logic elements such as NAND gates, NOR gates, inverters and D-type flip-flops. Only 
elements defined by the IEEE STD 1 164-1993 standard are used in the core. 

o D-type flip-flops with known power-up conditions. The VME64 To PCI Bridge has 
some internal bits that must be set to pre-defined states after a power-up or configura- 
tion reset. 

o Fourteen 256 x 16-bit block RAMs, and two 256 x 16-bit distributed RAMs. These are 
used for shared memory buffers. In order to support PCI burst transfers the RAM ele- 
ments must be capable of operating within a single clock cycle. For more information 
see the section below regarding memory integration. 



3.3.1 Clock Requirements 

Three clocks are required by the VME64 to PCI Bridge. These are called [PCLK], [VCLK] and 
[ECLK]. [PCLK] stands for PCI CLocK, and is used by the Xilinx LogiCORE PCI IP core. 
[VCLK] stands for Vme CLocK, and is used by the Silicore VMEcore(tm) element. [ECLK] 
stands for EEPROM clock, and generates the clock for the EEPROM hardware interface. 
[ECLK] is formed from [PCLK]. Under certain conditions [PCLK] and [VCLK] may be tied 
together to form a single clock interconnection. This is done on the VMEbus to PCI Bridge 
Core. Table 3-1 shows the requirements for the two clocks. 
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Table 3-1. A 



lnwable Frequenc 



Clock 
PCLK 
VCLK 



Fmin 


Fmax 


_0 


33.333 MHz 


" 33.333 MHz 


50.000 MHz 



Duty Cycle 
40/60 - 60/40 
40/60 - 60/40 



PCI Rev 2.2 

vita 1.1QQ4& VMEcore™ 



The [PCLK] Cocking constraints foUow those *£^3£E£ 

XiUnx LogiCORE PC. has severa, c— , - , ock ,oop (PLU Fur- 

19 

ponents . 

The [VCLK] clocking constrain* 

samples an asynchronous bus, and must^ be d b the need t0 sa mple the asyn- 
Figure 3-1. The 33.333 MHz minimum frequenc y isdi > ft can > t g0 above 

ch § ronous VMEbus signals fast ^^^^X^ strobe skew on the back- 
50.000 MHz because the period of [VCLK] musi not 

plane. 

lf [PCLK] ana [VCLK, operate a, (or neaO « ^tT««: 



» The PCI specification allows the PCI clock to vary as long as .t rema.ns 
^xlSrscountenneasures for these problems on their web site. 



monotonic and within the duty cycle 
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DSA* 



DSB* 



VCLK(min) /© ~\ 



\ 



30 nS. (min) 



\ 



f 



(a) AT VCLK(min) (33.333 MHz) AT LEAST ONE ASSERTED SAMPLE IS ASSURED ON BOTH DATA STROBES . 



DSA* 



DSB* 



VCLK(max) 



20 nS (max) 



f 



(b) AT VCLK(max) (50.000 MHz) AT LEAST ONE ASSERTED SAMPLE IS ASSURED AT MAXIMUM BUS SKEW. 

JAN 24, 2002 

Figure 3-1 . VMEbus requirements (constraints). 
3.3.2 Memory Requirements 

A wide variety of RAM memory elements can be used with the design. However, some types 
will operate faster and more efficiently than others. In general, if the memory interface closely 
resembles that needed by the IP cores in the design, then everything will run fast. If the memory 
is significantly different, then everything will slow down. 

The internal architecture of the VME64 to PCI Bridge assumes that all memories conform to 
something called 'FASM', or the FPGA and ASIC Subset Model 20 . That's because the Xilinx 
LogiCORE PCI, the Silicore VMEcore and the WISHBONE interconnection all rely on the same 
type of FASM synchronous RAM elements. 

The FASM synchronous RAM model conforms to the generic connection and timing diagram 
shown in Figure 3-2. During write cycles, FASM synchronous RAM stores input data at the in- 
dicated address whenever: (a) the write enable (WE) input is asserted, and (b) there is a rising 
clock edge. 



20 The original FASM model actually encompasses many type of devices, but here the focus will be on the FASM 
synchronous RAM models. 
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/ 



WE 



FEB 2001 
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immediately after the rising clock edge. 

XHe basic advantage behind the ^^S;* ST^ 
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tling the data transfers. 

The Silicone VMEcore and W.SHBONE interconnection systems both support single clock data 
transfers and data throttling. 
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dttces do support at leas, one style of FASM memory. 

Sine a Xi.in* ,P Core product is useuMn 

the SoC must support the Xilmx Block SelectKAM 
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conform to the FASM guidelines, and will not work in single clock systems. They require an 
extra clock cycle either at the front of the cycle (to register the address) or at the back end of the 
cycle (to register the data). Although these can be adapted to operate in a single clock configura- 
tion (see Figure 3-3), this option was omitted in the design because the Xilinx LogiCORE PCI 
would not route to these memories at 33 MHz on a Xilinx Spartan 2. 



DOUT 



DIN 
ADR 

WE XILINX 
SPARTAN2 
EN BLOCK 
RST MM 



CLK 



N CLK 



DIN() 
DOUT() VALItD 
ADR() 



WE 



EN 



RST 



WISHBONE 
CYCLE 



VALID 



WISHBONE 
^ CYCLE ^ 



^1 Tds 



VALID 



Tas 



VALID 



® 




j 



Tas 



READ CYCLE 



WRITE CYCLE 



FEB 2001 

Figure 3-3. Xilinx LogiCORE PCI block RAMs when configured for single cycle operation. 



The Xilinx Spartan 2 distributed RAM can support single clock bus cycles. Their main disad- 
vantages are that they contain fewer memory bits than the Xilinx Block SelectRAM+ elements, 
and consume many logic LUTs (look up tables). 
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The interna, memory anc 

either single or double clock memory cycles^ Thi in i*> y ^ d on 

for one or the other. If single clock ^"^^^to burst transfers are not sup- 
the PCI target interface. If in a single clock configuration (as de- 

temal circuit is effectively doubled from 33 to 66 Mhz. 

Tabie 3-2 shows the Xihnx RAM rypes used for the buffer memories in the VMEbus to PC, 
bridge. 
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4.0 VHDL Entity Reference 

The VME64 To PCI Bridge is organized as a series of VHDL entities. These are tied together 
into an entity called VMEPCIBR_SOC. All of the higher level (first and second tier) entities are 
described in this chapter in alphabetical order. For more information about the SoC hierarchy 
please refer to the VMEPCIBR SOC entity description below. 
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4.1 CEEPROM Entity 

S^SSffl- I Zt^XZ* a WISHBONE conrparible dara bus tnterfaces 
with characteristics shown in Table 4-1. 



> H Conf SerEn 



ConfSerDat 



> Conf SerClk 




CEEPMD: HIGH LEVEL ^J™^ 



I, n Bjsoicdi f "" configuatjon FFPROM Inlerface . 

Da ,a .—ions over .he da* interface are ^11^™^^ 
The entity also includes a dock fend. -eg* » J-^™, [ CL KJ] is operated a, 
[ECLK 0]. The nominal frequency of [ECLKJUJ b the VHDL entity. 

33 T ^ tnCLK -dL^Vfre^ey of [CLKJ, isa, 

Any frequency of [CLKJ J anojtu-N j 
least four times the frequency of [bLLR_uj. 

EEPROM control clock IBC^^ 

Sfi E SSSI&^BS a 8 ,oha, Cock net huffer to reside a, ,he 

highest system level for convenience. 
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Table 4-1. WISHBONE DATASHEET for the CEEPROM Entity 


General Description 


SLAVE interface for the Atmel AT 17 series 
of FPGA configuration EEPROM. 


WISHBONE Revision Level 


B.2 


Supported WISHBONE Cycles 


SLAVE: READ/WRITE 


Data port, size 


32-bit 


Data port, granularity 


16-bit 


Data port, maximum operand size 


32-bit 


Data transfer ordering 


BIG ENDIAN or LITTLE ENDIAN 


Data transfer sequencing 


None 


Signal Description 


All WISHBONE signal names are identical to 
those defined in the specification. 


Terminating Signals 


The WISHBONE interface on both ports sup- 
port only the [ACK_0] terminating signal. 
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The [EEPENA] signal is asserted whenever the 'ConflSerEn' bit (D07) is set in the CON- 
FIG_PROM_CMD register. This indicates that the EEPROM is enabled for configuration. It 
also indicates that the three-state clock and data pin drivers on the target device are enabled. Ex- 
ternal hardware (other than the EEPROM itself) must refrain from driving these signals when 
[EEPENA] is asserted. 

The WISHBONE interface includes all registers that are accessible through the interface. They 
operate at the nominal WISHBONE clock speed of 33 MHz. The WISHBONE interface also 
includes a clock domain transformation circuit as shown in Figure 4-3(a). Figure 4-3(b) de- 
scribes the relationship between the two clock domains. The clock transformation circuit allows 
two clock domains to communicate seamlessly. Figure 4-4 shows a timing diagram for the cir- 
cuit. 

The EEPROM state machine controls all low frequency activity for the EEPROM interface, with 
its state diagram shown in Figure 4-5. The state machine forms an instruction set with six in- 
structions. Five of these instructions correspond to the bits in the CONFIG_PROM_CMD regis- 
ter described elsewhere in this manual. A sixth instruction (WAIT) is a type of NOP (no opera- 
tion). When an instruction is requested (e.g. SendStartCondition), the state machine generates 
four 'RISC type' instructions. These, in combination with some control bits, handshakes with 
the EEPROM. A loop instruction is formed with a the BITCOUNTER process. Figures 4-6 
and 4-7 shows the high-level timing for the EEPROM. 
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PROCESS: EEPC REGISTER 
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PROCESS: SYNC COMMAND 
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STATES: MSI, MSO 
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EQUATIONS: 

MSO /MSI * /MSO * WCSDONE; 

MSI = /MSI* MSO * WCSDONE 
+ MSI * /MSO * WCSDONE; 
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11) UNUSED 



STATE DIAGRAM 

Fi gure 4-3(aY C jock domain Information circuit 



CEEPROM.VHD: HANDSHAKING CIRCUIT 
20 AUG 2002 
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NOTE: FOR PROPER OPERATION OF EACH BIT, THE FREQUENCY OF 'CLKJ 1 
MUST BE AT LEAST FOUR TIMES THE FREQUENCY OF 'ECLK 1 . 

BECAUSE: 

1 CLK_I FLIP-FLOP. SET-UP 

1 CLK I SYNCHRONIZATION 
+ 1 CLKJ MONOSTABLE STATE MACH. 
+ 1 CLKJ DATA FLIP-FLOP 
+ 2 CLK. I WISHBONE RMW CYCLE 



= 6 CLKJ OVERHEAD PER 1 ECLK 

+ 1 ECLK FLIP-FLOP SET-UP PER 1 ECLK 



FURTHERMORE, IT IS RECOMMENDED THAT THE FREQUENCY OF [CLKJ] BE SUFFICIENTLY 
HIGHER THAN THE FREQUENCY OF [ECLK] TO ALLOW FOR A 
HOST PROCESSOR TO READ AND WRITE TO THE BITS AND 
RESPOND ACCORDINGLY. 

THE MAXIMUM RESPONSE TIME OF THE HOST PROCESSOR CAN BE FOUND THUSLY: 
1 6 

Tav = 1 ECLK_MAX_FF_SET-UP 

ECLK CLKJ 

FOR EXAMPLE: 

F, CLKJ = 33,000 MHz 
F, ECLK = 0.400 MHz 

15 1 

Tav = - = 2.32 uS 

0.4 MHz 33 MHz 33 MHz 

FOR MORE INFORMATION, SEE THE TIMING DIAGRAM. 

CEEPROM.VHD: HANDSHAKING CIRCUIT 
20 AUG 2002 



Figure 4-3 (a). Clock domain transformation circuit (con'f). 
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Figure 4-5. State diagram for EEPROM State Machine. 
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Figure 4-7. EEPROM read and write bvte timing. 
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4.2 MISCREG Entity 



The MISCREG entity contains all ££ ^£ 

CIBR address map. It contams two Jg^SSSSUto served by VMEbus, and 
CIBR SoC the 'A' side is connected to the wlSHWJNt also cQn _ 

the <B' side is connected to the Mea ^^^^ t dual 'A' and V ports, 
tains all the arbitration logic necessary for handling accesses rro 

f a. MKPRFG entity and Table 4-2 shows the 
SB&'MS handled * *e - in- 

elude: 

• DMC HW CONTROL . 

. CONFIG PROM CMD (see the CEEPROM entity) 

• CONFIG'WRITE DATA (see the CEEPROM entity) 
. CONFIgIrEAD^DATA (see the CEEPROM entity) 

• DMCCMD 

. DMCFAULT 

• DMC_STATUS 




WISHBONE 
MISCREG 

AADRJE ( ) BADRJO 
ADAT_I ( ) BDAT_I{) 
ADAT_0 ( ) BDATJD ( ) 
AWE J BWEI 
ASEL I() BSELJO 




ASTB_I 
ACYCJ 
AACKO 
AERR_0 
ABRSTJ 
->|>ABCLK_I 



BSTBJ 
BCYC_I 
BACKO 
BERR 0 



BLOCK DIAGRAM 



15 APR, 2002 



Fj gureizj Ml S rpFri block diagram. 



The MISCREG arbiter and arbiter timing are 



shown in Figures 4-9 and 4-10 respectively. 
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Table 4-2. WISHBONE DATASHEET for the MISCREG Entity 


General Description 


Logic for the lower seven registers in the sys- 
tem. Contains dual WISHBONE ports named 
'A' and 'B\ This datasheet applies to both in- 
terfaces. 


WISHBONE Revision Level 


B.2 


Supported WISHBONE Cycles 


SLAVE: READ/WRITE 


Data port, size 


32-bit 


Data port, granularity 


16-bit 


Data port, maximum operand size 


32-bit 


Data transfer ordering 


BIG ENDIAN or LITTLE ENDIAN 


Data transfer sequencing 


None 


Signal Description 


All WISHBONE signal names are identical to 
those defined in the specification, except that 
they have an 'A 5 or 'B' at the front. The 'A 5 
and 'B' refer to PORT A and PORTB respec- 
tively. 


Terminating Signals 


The WISHBONE interface on both ports sup- 
port [ACK_0] and [ERR O] terminating sig- 
nals. [ERRJ3] is generated when accesses to 
the unused/reserved registers are attempted. 
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'A' SIDE GRANT 




ARBITER EQOKnCNS 



AGNT 



BGNT 



INPUTS: BCYC_I, ACYC_I 
STATES: BGNT, AGNT 
CLOCK: BCLK 

'A' SIDE PRIORITY 



+ /ABRSri * /AGNT * /BCYC_I * ACYC_I, 

= /ABRST I * BGNT * /AGNT * BCYCI 

+ /ABRSRI * /BGNT * BCYC_I /ACYCJ, 



B' SIDE GRANT 

ARBITER STATE MACHINE 



15 APR, 2002 



Fi gure 4-9. M ISCREG arbiter. 
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Figure 4-10. MISCREG arbiter timing. 
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4.3 PCIWRAP Entity 

The PCWRAP entity is a wrapper between the ^^^^^4' 
the TCIWRAPc.VHD' file. 




\7 



E(tm) 
2t Only) 




VSTER 


11 


tVRAP' 
per Ent 


|g ! 


*S 


2f 


SHBO 


*2 




1 



WISHBONE 



5> 



Local 
System-on-Chip 



FPGA or ASIC Target Device 



20 SEP 2002 



,: r ..„ 4j i ^ hlnck dj a gant showjng the PHWR APc wrapper. 

Figure 4-12 shows a functional diagram of the PCIWRAPc entity. Each box in the diagram cor- 
responds to a VHDL process (as described below) of the tame name. 

The left side of the .** S — aTa SS £?S 

thesized, routed and connected to the PCI core 1 ne ^ Logi- 

PCI core and signal names. 

The PCI target interface responds to the following PCI commands (cycles): 

. Configuration Read (CBE[3:0] = 1010) 

. Configuration Write (CBE[3:0] = 101 1) 

. Memory Read (CBE[3:0] = 01 10) 

• Memory Write (CBE[3:0] = 0111) 

. Memory Read Multiple (CBE[3:0] = 1 1 00) 

. Memory Read Line (CBE[3:0] = 1110) 

The PCI wrapper supports BYTE granularity. However, the WISHBONE interface can be easily 
configured for WORD and DWORD granularity as well. 
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All PCI accesses to 'unused' address areas are terminated with a PCI TARGET ABORT. 

The wrapper circuitry connected to Xilinx LogiCore PCI interface controls how the target inter- 
face responds to the PCI commands. During the initial phase of a PCI burst cycle, a binary 
counter (located inside the wrapper circuitry) latches the starting PCI address. The counter in- 
crements during subsequent phases within the burst cycle, thereby generating the next address. 
The counter is incremented after every cycle that accesses the high byte of a 32-bit DWORD 
transfer. The high byte is indicated when [S_CBE(3)] is low. That means that the address 
counter is incremented after every 32-bit transfer or after a high order 16-bit transfer. 

PCI single and burst transactions are supported by the wrapper. However, burst transactions 21 
may not necessarily be supported by all memory locations or registers. If a burst transaction is 
attempted in a memory region that does not support burst transfers, then the memory should re- 
spond with the WISHBONE [ERR I] signal, which causes the wrapper to respond with a TAR- 
GET ABORT termination 22 . 

The wrapper is implemented as a PCI target, meaning that it cannot initiate PCI transactions. 

The right side of the block diagram shows the signals connected to the WISHBONE MASTER 
interface. The interface supports 8, 16 and 32-bit data transfers and a 32-bit address bus. Table 
4-3 is the WISHBONE DATASHEET that specifies the interface. The reader is directed to the 
WISHBONE specification, revision B.2 for a complete description of the WISHBONE MAS- 
TER and related signal names. 



21 Burst transactions are bus cycles with more than one data transfer phase. 

22 Burst transactions in excess of one data transfer are allowed as long as the memory structure supports it. This is 
because the core can be implemented on a number of target devices, memory types and speeds. The Xilinx Logi- 
Core PCI requires that memories must support single clock data transfers. If this type of memory is implemented on 
the target device, then the burst operation is supported. If this type of memory is not implemented on the target de- 
vice, then burst transactions are not supported. For more information please refer the Hardware Reference section of 
this manual. 
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J DIO THREESTATE 



si 

8§ 

a! 



LOGICORE PCI 
INTERFACE 



-> 



-> 



Q D I 



ADIO(3.1..10) 



Cft OQ 

Oj CO 
Oi M 

ID t3 



ADRQ1..10) 



ADIO(Q9..02) 



3 



COUNTER 
INC CONTROL 





ADR(09..02) 



-> 



& CO 



-> PCMD(3..0) 



SEL<1) 



GLOBAL CLOCK NET 



-> 



D Q 



WE 0 



DAT 1(31. .0) 



-> ADRO(31..02) 



-+ SEL_O{3..0) 



-> CLK 0 



-> RST_0 / 



WISHBONE 
' SYSCON 



WISHBONE 
INTERFACE 

XILINX LOGICORE PCI TO WISHBONE WRAPPER 
VHDL ENTITY: PCIWRAP 
06 AUG 2002 



Fianre 4-12. Functional diagram of PfJWRAPC entity. 
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Table 4-3. WISHBONE DATASHEET for the PCIWRAP Interface. 


General Description 


Wrapper between a 32-bit Xilinx Logi- 
CORE(tm) PCI interface and WISHBONE SoC 
interface. 


WISHBONE Revision Level 


B.2 


Supported WISHBONE Cycles 


MASTER: READ/WRITE 
MASTER: BLOCK READ/WRITE 


Data port, size 


32-bit 


Data port, granularity 


8-bit 


Data port, maximum operand size 


32-bit 


Data transfer ordering 


LITTLE ENDIAN 


Data transfer sequencing 


Sequential (burst) data transfers must be made 
from lower to higher addresses. The internal 
address counter rolls over whenever a most sig- 
nificant byte is transferred. 


Signal Description 


All WISHBONE signal names are identical to 
those defined in the specification. Refer to the 
signal descriptions for more details. 


Terminating Signals 


The WISHBONE interface supports all three 
termination signals: [ACK I], [RTY I] and 
[ERR_I]. See the text for more information 
about the operation of these signals. 
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4.3.1 PCIWRAP Timing Conversion 

signals. Diagrams for read and write cycles are shown. 
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Figure 4-13. READ timing between PCI and Xilinx PCI core back end signals. 
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Figure 4-15. PCI memory burst read cycle to WISHBONE block read. 
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4.3.2 Operation of the WISHBONE Termination Signals 

The PCIWRAP entity supports all three WISHBONE termination signals: [ACK_I], [RTYI] 
and [ERR_I]. These are translated to control signals used by the Xilinx LogiCORE(tm) PCI 
core. The wrapper responds differently to these signals, depending upon when they are asserted 
during the PCI bus cycle. 

Also note that the WISHBONE specification requires that only one termination signal be applied 
at any given time. Stated another way, [ACK I] and [ERR I] (or other combinations) cannot be 
asserted at the same time. Assertion of two or more acknowledge signals at any given time could 
result in an undefined operation. 



4.3.3 PCI Normal Termination Using [ACK_I] 

Figure 4-17 shows the timing diagram for a normal termination to a participating PCI TARGET 
cycle. There, the WISHBONE MASTER cycle starts in response to the assertion of 
BASE_HIT() by the Xilinx LogiCORE PCI (not shown). The PCIWRAPC wrapper then waits 
for the participating WISHBONE SLAVE to respond by asserting its [ACK_I] signal. 
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Figure 4-17. PCI normal termination. 
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, n m e„ry, me WISHBONE SLAVE can insertany ^^^^.TSWS 
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formed in response to the assert on of l^ C ^ ™^ availaMe t0 resp „„d immediately to 
arbitration has been complex, the SLAVE g Should be ^ g^AVE should only 

multiple assertions of [STB O] by the MASTb^ ^ (0 , he 

Onee the WISHBONE MASTER induce has £ 

[S_READY] signal ro the PCI con. This signa remams «rt *> t w 

bus cycle, regardless of the state of the ACKJ] sigML now • DISCONNECT W/O 

signals are asserted, then the wrapper w,ll generate either a PCI TAKGbl uir> 

DATA or a TARGET ABORT. 

During PCI burs, rransfers the [S.READY] signal the end ofrhe POtaj. 
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throttle the speed of the transfer during this time. 

4.3.4 PCI Target Retry Termination Using [RTY_I] 

u • j|,„,, m fnr a TARGET RETRY termination to a participating PCI 
Figure 4-18 shows the ^ starts in response to the assertion of 

TARGET cycle. There, the WISHBONb MAM tK y fof ^ _ 

BASE_H1T() (not shown) by the PCI ^ ^P^ s JSoNE SLAVE responds by as- 
pating WISHBONE SLAVE to respond. Normall y [RTY I], then the PCI- 

Lrting the [ACKJ] ^"^^^ 
WRAPC wrapper asserts [SJIbKMJ to tne r^y ^uic, 
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m [RTY_I] -> PCI TARffiT RETRY 
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Figure 4-18. PCI target retry termination. 



4.3.5 PCI Target Abort Termination Using [ERR_I] 

Figure 4-19 shows the timing diagram for a TARGET ABORT termination. There, the 
WISHBONE MASTER cycle starts in response to the assertion of BASE_HIT() (not shown) by 
the PCI core. The PCIWRAPC wrapper then waits for the participating WISHBONE SLAVE to 
respond. Normally, the WISHBONE SLAVE responds by asserting the [ACK I] signal. How- 
ever, if the WISHBONE bus cycle is terminated by asserting the [ERR I] signal, then the wrap- 
per asserts [S ABORT] to the PCI core. This results in a TARGET ABORT termination. The 
TARGET ABORT can be generated at any time during a single or burst transfer. 
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Figure 4-19. PCI target abo rt termination. 



4.3.6 VHDL Entity Reference 

The PCIWRAP entity is synthesized from a top 

m odule of VHDL code is named with a seven charter h ndle £ tf § g 

An additional (final) character is added to indicate it s use. A C 

VHDL circuit file, a 'T' character t^PCIWRAP' entity/architecture 

cates that it's a test vector file (with a .txt tile extension;, 
has the following names associated with it: 



Entity (circuit) name: jS™^ 

Architecture name: PClWKArci 

Entity/architecture filename: PCIWRAPC.VHD 

Test bench filename: ^Sl m 

Text vector filename: PC1WRAPV.TXT 
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4.3.7 Process: ADIO_THREESTATE 

The ADIO THREESTATE process generates a three-state output buffer on the 32-bit ADIO 
bus. 

4.3.8 Process: COMMAND_REGISTER 

The COMMANDREGISTER process creates the command register. The command register 
stores the state of the PCI command that is indicated on 'S_CBE()* during the initial (address) 
phase of every bus cycle. 

4.3.9 Process: COUNTER_INC_CONTROL 

The COUNTER_rNC_CONTROL generates the counter increment signal 'CINC. The counter 
is incremented at different times, depending on if it's a read or a write cycle. The type of cycle is 
determined from the TCMD(3..0)' register, which stores the state of the PCI command (on 
[C/BE[3:0]#). The counter is incremented only during Memory Read Multiple ([C/BE[3:0]# = 
B"l 100") and Memory Write ([C/BE[3:0]# = B"01 11") cycles. 

The counter is incremented after every cycle that accesses the high byte of a 32-bit DWORD 
transfer. The high byte is indicated when [S_CBE(3)] is low. That means that the address 
counter is incremented after every 32-bit transfer, after a high order 16-bit transfer and a high 
order 8-bit transfer. 



4.3.10 Process: CYCLE_CONTROL 

The CYCLE CONTROL process generates the WISHBONE [STB_0] and [CYC_0] signals. It 
also generates the [S READY], [S_TERM] and [S ABORT] signals to the Xilinx Logi- 
CORE(tm) PCI interface. The process uses the state machines and equations shown in Figure 4- 
20. 
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CYCLE CONTROL STATE MACHINE : 

STATES: CYC, FSTB 

INPUTS: PWC, PRC S_DATA, BASE_HIT 

EQUATIONS: 

PRC = /S WRDN * S_SRC_EN; 
PWC = SJfRDN * S__DATA_VLD; 



FSTB := /RST__I 
+ /RST_I 



CYC 



:= /RST_I 
+ /RST_I 
+ /RST__I 
+ /RST__I 



/CYC ■ 
CYC 

/CYC 
CYC 
• CYC 
' CYC 



/FSTB 
FSTB 

/FSTB 
/FSTB 
FSTB 



* /PWC * /PRC 



RASEJilT 
S DATA; 



/PWC 



* BASE_HIT 

* S_DATA 

* S_DATA 
/PRC * S DATA; 



CYCO = CYC; 

STB_0 = /RST_I ws ^ * sjrc EN 

+ /RST'I * SJWRDN * S_DATA_VLD; 



XXIX 



XXOX^' 



XXOX 
11XX 




r^xxxo 

UNUSED T 

RST I 



HANDSHAKING STAT* MACHINE (PART OF 'CYCU5 CONTROL'): 

STATES: ABORT, TERM, READY 

INPUTS : SDATA, FSTB, ERR_I, RTY_I, ACK_I 



EQUATIONS: 

S READY <= READY; 
S — TERM <= TERM; 
S — ABORT <= ABORT; 



1X01X n / 




RST_I & 
UNUSED STATES 



TNTTT*T CYCLE STATE MACHINE (PART OF 'CYCIE CONTROL') 



STATES: PCYC 

INPUTS: BASE_HIT, AER 



EQUATIONS: 

AER = ACK_I + ERR_I + RTY_I ; 

PCYC - /RST I * /PCYC * BASE HIT 
+ /RSTII * PCYC * /AER; 




ICYC = PYC * AER; 



RST I 



™ I^GICOEE PCI «™» S« 

06 AUG 2002 



Figure 4 



.on CYCLE CQNJRQI a " ri HANDSHAKING state machines. 
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4.3.11 Process: LOWER_ADDRESS_COUNTER 

The lower address counter generates the lower eight bits of the WISHBONE address bus. The 
counter is needed because the PCI core generates a base address during the first phase of a bus 
cycle. Subsequent PCI bus cycles do not generate any address information. The initial base ad- 
dress is preloaded into the counter. After that, each PCI cycle phase increments the counter. 

The counter generates the lower eight bits of the local address bus ADR(9..2). The upper 
twenty-four bits are held by a latch. Local address bus ADR(33..2) generates the WISHBONE 
address bus ADR_0(33..2). The counter and latch implement the PCI to WISHBONE address 
translation. The PCI interface uses a 32-bit address bus with four byte enables. This effectively 
allows 34-bit byte addressability. The WISHBONE interface has an 8-bit granularity. For this 
reason the PCI address bits are shifted up by two bits, and the byte enables are translated into the 
WISHBONE SEL_0() bits. 

The counter is designed in three sections, with the first two sections having a terminal count 
(TCNTX) bit. This reduces the number of 'and' terms in the equations of the higher counter bits. 

The counter design has been used in other projects, and represents a reasonable compromise be- 
tween speed and complexity. 

Important signals and their uses are: 

ADR(9..2): COUNTER OUTPUT (LOCAL ADDRESS BUS) 

ADIO(3 1 ..0): COUNTER DATA INPUT (USED FOR PRELOAD) 

CINC: COUNTER INCREMENT 

ADDR VLD: COUNTER PRELOAD 

TCNT2: TERMINAL COUNT FROM BIT 2 

TCNT5 : TERMINAL COUNT FROM BIT 5 



4.3.12 Process: SELECT_CONTROL 

The SELECT CONTROL process generates the local [SEL()] and WISHBONE [SEL_0()] sig- 
nals. 



4.3.13 Process: UPPER_ADDRESS_REGISTER 

The UPPER_ADDRESS_REGISTER process creates a 24-bit register, and captures the state of 
the upper twenty-four address lines coming from ADIO(31..08). These remain static throughout 
the entire PCI bus cycle, regardless of the type of PCI bus cycle. The lower eight bits are cap- 
tured by the address counter. 
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Important signals include: 

ADR(33 10)- REGISTER OUTPUT (LOCAL ADDRESS BUS) 
ADIO(3 1 ..8): REGISTER DATA INPUT 
ADDR_VLD: REGISTER LOAD 



4.3.14 Process: WISHBONE_SYSCON 

The WISHBONE SYSCON process generates the WISHBONE SYSCON signals [CLK_0] and 
[RST_0]. The [CLKJ] and [RSTJ] signals should be tied to the [CLK_0] and L KM_uj g 
nals outside of this entity. 

[RST_0] is a synchronized version of the asynchronous [RST] signal generated by the Xilinx 
LogiCORE. 



4.3.15 Process: DI_REG 



• Moto in' wakter It's onlv purpose is to overcome some timing prob- 
TheDI_REG process is a data in register, it s oniy P*^P LogiCORE PCI. If this register 
lems associated with the three-state buffers ^J^^^^L, the data in port 
is removed the router attempts ta ^^^^^ AT 0()] and then on to 
rr>AT ini through the three-state buffers, back to the data out pon y J , 
SS^mtnents. This timing path is broken when the regtsters ate added. 
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4.4 SEMABUD Entity 

The SEMABUD entity is a shared memory. It is very similar to the SEMABUF entity, except 
that it implements the memory buffer with Xilinx Spartan 2 distributed RAM instead of Block- 
Select+ RAM. Furthermore, this entity responds in one clock cycle (the SEMABUF entity re- 
sponds in two cycles). For more information please see the 'SEMABUF' entity described else- 
where in this manual. 
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4.5 SEMABUF Entity 

The SEMABUF entity is a shared memory buffer As ^Ig^JfiSfPSl 
Figure 4-22. 




WISHBONE 
SHARED MEMORY 
SEMABUF 

AADR_I() BADRIO 

ADATJO BDATJO 

ADAT_0() BDAT_0 ( ) 
AWE_I BWE J 

ASEL_I() BSELJO 
ASTBJ BSTB_I 
AACK_0 BACK_0 
ACYC I BCYC_I 



ABRSTJL 
>t>ABCLK_I 



'32 



s 



CO 



BLOCK DIAGRAM 

10 APR, 2002 

Fjgurg 4-21. SEMABUF bl ock diagram. 



The buffer operates as an independent shared memory . This — £. 

cry supports full, simultaneous, read/wr, te pnvfc g* ; .nto ea ch ^ ^ fin ? hed its opera . 

is controlled by an internal arbiter circuit. 

Table 4-4 gives the specifications for the WISHBONE ports. 

, j . .„,__ p ORT a or PORT B gains access to the shared memory 
^T^tZ^ZZ^L^ wfth a state diagram shown in Figure 
4-23, with the related timing diagram shown in Figure 4-24. 
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WISHBCNE ACKNOWLEDGE SIGNALS 



MCK_0' = ARDY * ASTB_I; 

BACK 0 = BGNT * BSTB I; 



RAMB4_SgJS# 





WEA 




ENA 




RSTA 




>CLKA 




ADDRAtftO] 




DIAJS.-O] 




□ 




WEB 




ENB 




RSTB 




> CLKB 




ADDRBI*:0] 




DIB[#:0] 



DOA[#:0} 



DOB(tf:Oj 



Figure 1: Dual-port Block SelectRAM* Memory 

XILINX SelectRAM+ CONNECTION DIAGRAM 



XILINX SPARTAN 2 DUAL-PORT SelectRAMf EQUATIONS 

PORTA - BANK #0 

WEA = AWE I; 

ENA = ARDY * ASTB_I * ASELJ (0) ; 

RSTA = GND; 

CLKA = ABCLKJ; 

ADDRA(7..0) = ADDR_I(8..1); 

DINA(15.. 0) = ADAT 1(15. .0); 
ADAT_O(15..0) = DOA(T5..0); 

PORTB - BANK #0 

WEB = BWE_I; 

ENB = BGNT * BSTB_I * BSEL_I(0); 

RSTB = GND; 

CLKB = ABCLK_I ; 

ADDRB(7..0) = BDDR_I(8..1); 

DINB(15..0) = BDAT I (15. .0) ; 
BDAT_O(15..0) = DOB(T5..0); 

PORTA - BANK #1 

WEA = AWE_I; 

ENA = ARDY * ASTB_I * ASEL_I(1); 

RSTA = GND; 

CLKA = ABCLK_I; 

ADDRA(7..0) = ADDR_I(8..1); 

DINA(15..0) = ADAT 1(31.. 16); 
ADAT_O(15..0) = D0A(7l..l6); 

BANK #1 



PORTB 
WEB 
ENB 
RSTB 
CLKB 

ADDRB(7..0) 
DINB(15..0) 



= BWE I; 

= BGNT * BSTB_I * BSEL_I(1); 

= GND; 

= ABCLKJ; 

= BDDR_I(8..1); 

= BDAT I (31. .16) ; 



BDAT 0(15.. 0) = DOB (71.. 16); 



29 SEP 2002 



Figure 4-22. Implementation details for the Xilinx BlockSelect+ RAM. 
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Tn u,. a a WISHBONE DATASHEET for the SEMABUF Enjit 



General Description 



WISHBONE Revision Level 



Supported WISHBONE Cycles 



Data port, size 



Data port, granularity 



Data port, maximum operand size 



Data transfer ordering 



Data transfer sequencing 



Signal Description 



256 x 32-bit shared memory buffer with two 
WISHBONE SLAVE interfaces. This datasheet 
applies to both interfaces. 



B.2 



MASTER: READ/WRITE 
MASTER: BLOCK READ/WRITE 



32-bit 



16-bit 



Terminating Signals 



32-bit 



BIG ENDIAN or LITTLE ENDIAN 



None 

All WISHBONE signal names are identical to 
those defined in the specification, except that 
they have an 'A' or 'B' at the front. The 'A' 
and 'B' refer to PORT A and PORTB respec- 

tively. 

The WISHBONE interface on both ports sup- 
port only the [ACK_0] terminating signal. 
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ABRST I 



INPUTS: BCYC_I, ACYC_I 
STATES: BGNT, AGNT 
CLOCK: BCLK 

'A' SIDE PRIORITY 



ARBITER BgQMUCNS 

AGNT = /ABRST I + /BGNT * ACYC_I 

+ /ABRST J + /BGNT + /BCYC_I + ACYC_I 

+ /ABRST_I * /AGNT * /BCYCJL * ACYC_I; 

BGNT = /ABRST I * BGNT + /AGNT * BCYC_I 

+ /ABRST"! * /BGNT + BCYCJ + /ACYC_I; 



SEMABUF ARBITER STATE MACHINE 



10 APR, 2002 



Figure 4-23. SEMABUF arbiter state machine. 
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4.6 SEMAREG Entity 

The SEMAREG entity provides a one-bit semaphore to two WISHBONE SLAVE interfaces. 
The interfaces are called 'A' and 'B\ The semaphore is intended to be used for shared memory 
buffers where one interface or the other must gain access to the memory. The semaphore does 
not lock the memory buffer (which is located elsewhere) but just provides a mechanism for sys- 
tem software to determine if the particular memory buffer is being used. Figure 4-25 shows a 
block diagram of the entity. 
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Figure 4-25. SEMAREG block diagram. 



The semaphore is accessed by reading the bit. If the bit is returned as '0', then the WISHBONE 
MASTER (usually a processor) accessing the device is granted the semaphore. If the bit is re- 
turned as '1\ the buffer is busy. If a processor obtains the buffer by reading '0' (becomes the 
owner), and the bit is sampled for a second time, then the bit is returned as T on the second ac- 
cess. 

If both ports attempt to access the semaphore t the same time (i.e. during the same clock cycle), 
then port 'A' access has priority. 

The buffer is released by writing a '0' to the semaphore. Writing a T to the bit does not have 
any effect. The semaphore may be released from either port. 

Table 4-5 shows the WISHBONE DATASHEET for the SEMAREG entity. 
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Table 4-5. WISHBONE DA 



General Description 



WISHBONE Revision Level 



ASHEET for the SEMAREG Entit 

32-bit semaphore register with WISHBONE 
SLAVE ports ('A' and 'B'). This datasheet ap- 
plies to both interfaces. 



B.2 



Supported WISHBONE Cycles 



MASTER: READ/WRITE 



Data port, size 



32-bit 



Data port, granularity 



Data port, maximum operand size 



16-bit 



32-bit 



Data transfer ordering 



BIG ENDIAN or LITTLE ENDIAN 



Data transfer sequencing 
Signal Description 



Terminating Signals 



None 

All WISHBONE signal names are identical to 
those defined in the specification, except that 
they have an 'A' or 'B' at the front. The 'A' 
and 'B' refer to PORT A and PORTB respec- 

tivel) 

The WISHBONE interface on both ports sup- 
port only the [ACK_0] terminating signal. 



78 



The circuit arbiter determines whether PORT A or PORT B gains access to the shared memory 
buffer. The arbiter is composed of a two bit state machine, with a state diagram shown in Figure 
4-26. The timing diagram is shown in Figure 4-27. 



STATES: AGNT 
INPUTS: AREQ, GNT 
CLOCKS: BCLK 

'A' SIDE PRIORITY 

EQUATIONS: 

AGNT : = /ABRST I * AREQ * /GNT 
+ /ABRST - ! + AREQ * AGNT; 



STATES: GNT 
INPUTS: REQ, REL 
CLOCKS: BCLK 



EQUATIONS: 

GNT := /ABRST I * /GNT * REQ 
+ /ABRST J * GNT * /REL; 

REQ = AREQ + BREQ; 
REL = AREL + BREL; 



STATES : BGNT 
INPUTS: MBREQ, GNT 
CLOCKS: BCLK 

'A' SIDE PRIORITY 

EQUATIONS: 



BGNT := /ABRST_I * MBREQ + /GNT 
+ /ABRST_I * MBREQ * BGNT; 

MBREQ = /AREQ * BREQ; 




ABRST_I • 

AGNT STATE MACHINE 



ABRST I 




ABRST I 




GNT STATE MACHINE 



BGNT STATE MACHINE 



OTHER EQUATIONS: 

AREL = ASTB_I * AWE_I * ASEL_I(0) * /ADAT_I(0); 

AREQ = ASTB_I * /AWE_I * ASEL_I(0); 

SAACK := ASTB_I; 

AACKO = SAACK * ASTBJ; 

BREL = BSTB_I * BWE_I + BSEL_I(0) + /BDAT_I(0); 

BREQ = BSTB_I + /BWE_I * BSEL_I{0; 

SBACK := BSTB I; 

BACK 0 := SBACK * BSTB I; 



30 MAY, 2002 



Figure 4-26. SEMAREG state machines. 



The semaphore is formed from a series of three, one-bit state machines called 'GNT', 'AGNT' 
and 'BGNT'. The 'GNT' state machine indicates if the semaphore has been granted or not. 
Semaphore accesses from either side of the entity will cause the semaphore to be set. The other 
two state machines determine if one side of the entity or the other is granted the semaphore. 

For example, if the semaphore is not granted, and a semaphore request occurs from the ' A' side, 
then the 'GNT' state machine bit will transition from '0' to T. At the same time the 'AGNT' 
state machine will also transition from '0' to T. This grants the semaphore to the 'A' side. 
Subsequent accesses from the 'A' side will not grant the semaphore, as it is designed so that the 
semaphore is only granted during the first read access. 

Once the semaphore has been granted, neither side can obtain it until the semaphore is released. 
The release can come from either port of the interface. 
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4.7 VMEcore™ Entity 

The VMEcore™ entity is a A24:D32:D16:D08(O) VMEbus SLAVE to WISHBONE MASTER 
bridge. As shown in Figure 4-28, the bridge allows connection to a local SoC interconnection. 
The entire interface is synthesized as the 'VMECOREc' VHDL entity. Table 4-6 shows the 
characteristics of the WISHBONE interface. 



5 



\7 



II 



WISHBONE 



Local 
System-on-Chip 



FPGA or ASIC Target Device 



23 SEP 2002 



Figure 4-28. Block diagram of the VMEcore™ interface. 



Features of the VMEcore(tm) interface include: 

o A24:D32:D16:D08(EO) VMEbus slave interface, 

o Compact design minimizes gate count and per-unit costs, 

o Enforces VMEbus interface rules and timing, 

o 32-bit WISHBONE MASTER (backend) interface. 

o Allows fabrication of a complete VMEbus SLAVE on a single chip (with peripherals), 

o Provided as VHDL source code. 
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Tlll ,„ \ t wrgHBONF DATASHEET for the VMEcore(tm) 



General Description 



WISHBONE Revision Level 



Supported WISHBONE Cycles 



Data port, size 



Data port, granularity 



Data port, maximum operand size 



Data transfer ordering 



Data transfer sequencing 



Signal Description 



Backend interface for a VMEbus SLAVE IP 
. Operates as a WISHBONE SoC interface. 



core 



Optional [ERRJ] support 



B.2 



MASTER: READ/WRITE 
MASTER: RMW 



32-bit 



8-bit 



32-bit 



BIG ENDIAN 



UNDEFINED 



All WISHBONE signal names are identical to 
those defined in the specification. Signal 
[A24SGL_0] is a tag with TAG TYPE: 
[TGA O]. Refer to the signal descriptions for 
more details. 



WISHBONE cycles terminated with [ERRJ] 
terminate VMEbus cycles with BERR*. 
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4.7.1 VMEbus SLAVE Interface Signals 

All VMEbus slave signal names have the '_V or '0' characters attached to them. These indi- 
cate if the signals are an input (to the core) or an output (from the core). For example, [ACK_I] 
is an input and [ACK_0] is an output. This convention is used to clearly identify the direction of 
each signal. 

Signal arrays are identified by a name followed by a set of parenthesis. For example, [DAT_I()] 
is a signal array. Array limits may also be shown within the parenthesis. In this case the first 
number of the array limit indicates the most significant bit, and the second number indicates the 
least significant bit. For example, [DATJ(31..0)] is a signal array with upper array boundary 
number thirty-one (the most significant bit), and lower array boundary number zero (the least 
significant bit). The array size on any particular core may vary. In many cases the array bounda- 
ries are omitted if they are irrelevant to the context of the description. 

When used as part of a sentence, signal names are enclosed in brackets '[ ]\ This helps to dis- 
criminate signal names from the words in the sentence. 

The VMEbus interface signals can be directly connected to the target device, or routed through 
buffer chips. Buffer chips are generally used on the VMEbus interface because FPGA and ASIC 
target devices usually do not have compatible inputs and outputs. VMEbus is based on TTL 
interface standards, and regulate 24 : 

o Noise margins 

o Load current (inputs) 

o Output short circuit current 

o Input clamp voltage 

o Capacitive loading 

o Output current 

o Backplane impedance 

VAJO 

VMEbus address input signal array [VA_I()]. Connect these signals to the corresponding VME- 
bus address lines AO 1 - A3 1 (either directly or through a buffer). 



VAM_I() 

VMEbus address modifier input signal array [VAM_I()]. Connect these signals to the corre- 
sponding VMEbus address modifier lines AMO - AM 5 (either directly or through a buffer). 



24 For more information, the reader is directed to the ANSI/VITA 1-1994 standard. 
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VD - 1 ^ . • i r\m tai Connect these signals to the corresponding VMEbus 
VMEbus data input signal array [VDJOJ- Connect mese sign* 

data lines D00 - D3 1 (either directly or through a butter). 

VD_O0 Tnnnect these signals (usually through a buffer) to 

~?vf senera " y connec,ed throu8h a 

buffro prevideUcient drive current to the VMEbus backplane. 

VNAS- 1 , . • , r\/MA<; n Connect this signal to the VMEbus [AS*] sig- 

^^Z^T^L^'^ and [AS-, are active iow 

signals. 

VNDSOJ ,rvun<u. Tl Connect this signal to the VMEbus [DSO*] signal 

XW-l^Sh PSO 8 ., and [VNDSOJ, are active iow 



VNDS1J . • .rxnvmci n Connect this signal to the VMEbus [DS1*] signal 

sssas a^^wss - <™ - — - 

signals. 

VNDTACK__0 rv/xmTArK Ol Connect this signal to the 

VMEbus data transfer acknowledge output signal ^^-^T^ide sufficient cur- 
VMEbus [DTACK*] signal. It is generally connecte d ^^^^rmXTACK O] are 
rent drive to the VMEbus backplane. Also note that both [DTACK j and [vinu 

active low signals. 

VNIACK_I , nruiArv ti Connect this signal to the VMEbus 

[VNIACKJ] are active low signals. 
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VNLWORDJ 

VMEbus long word input signal [VNLWORDJ]. Connect this signal to the VMEbus 
[LWORD*] signal (either directly or through a buffer). Also note that both [LWORD*] and 
[VNLWORD J] are active low signals. 



VNSYSRESET I 

VMEbus system reset input signal [VNSYSRESET I]. Connect this signal to the VMEbus 
[SYSRESET*] signal (either directly or through a buffer). Also note that both [SYSRESET*] 
and [VNSYSRESET I] are active low signals. 



VN WRITE J 

VMEbus write input signal [VNWRITEJ]. Connect this signal to the VMEbus [WRITE*] sig- 
nal (either directly or through a buffer). Also note that both [WRITE*] and [VNWRITEJ] are 
active low signals. 



VSAC24J0 

A24 VMEbus SLAVE address compare input signal array [VSAC24J0]. This array determines 
when the SLAVE interface is selected by a VMEbus cycle. It is used by the local address com- 
parator to decode the destination address of a bus cycle. They may be connected to a dip-switch 
or latch on the board which holds the base address of the SLAVE. 



VSAE24 I 

The A24 VMEbus SLAVE address enable input signal [VSAE24 I] enables the SLAVE inter- 
face. In most cases it should be permanently asserted (i.e. tied high). However, in some cases 
this signal is useful if the interface needs to be disabled. 



VTSTJ 

The simulation test input signal [VTSTJ] forces all self-starting counters and other devices in 
the VMEbus interface to an initial state. It is used for test simulation purposes, and should be 
negated (i.e. tied low) during normal core operation. 
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4.7.2 WISHBONE MASTER Interface Signals 



core is participating in the cycle. It conforms to WISHBONE TAO L _ 

cases this signal is superfluous and can be ignored. 

Sic input ICLKJ] -dinates al, activi^ 

interconnect. All WISHBONE output signals ^ registered at the sing g 
WISHBONE input signals are stable before the rising edge of [CLK_I]. 

DAT - 10 • rn a t tai k ,Ked to oass binary data. The array boundaries are determined 

The data input array [DATJO is used to pass omary j i( 6 3..0)]). Also see the 

by the port size, with a maximum port size of 64-bits (e.g. [DAi_ito w 
[DAT_0()] and [SEL_0()] signal descriptions. 

^output array [DAT_0()] is used to pass bjjj, jd-J- ^g^S^Z 
mined by the port size, with a maximum port size of 64-b.ts (e.g. [DAijto w 
[DATJO] and [SEL_0()] signal descriptions. 

RST - 1 Tn * >u uncuRONF interface to restart. Furthermore, all internal 

The reset input [RSTJ] forces he WISHBONE mtertace tore resetg ^ 

self-starting state machines will be forced ^^»^^ m ^ g (aWl0U ^ it may be 
WISHBONE interface. It is not required to reset other parts or an 

used that way). 

TGDJO M . , A4 A ctpr and SLAVE interfaces. It contains information 

Data tag type [TGDJO] is used on MAS TER jmd . s ^ " rf , [STB T] . For examp le, 
that is associated with a data lines [DAT JO], and £™ ^ g [ 0 the data bus. 

parity protection, ^^^1*^*7 

*» name and operation of a data tag must be de- 

fined in the WISHBONE DATASHEET. 

TGD_O0 . X4ACTPR and ST AVE interfaces. It contains information 
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bus cycle) is pre-defined by this specification. The name and operation of a data tag must be de- 
fined in the WISHBONE DATASHEET. 



ACK_I 

The acknowledge input [ACK_I], when asserted, indicates the termination of a normal bus cycle. 
Also see the [ERRJ] and [RTY J] signal descriptions. 



ADRJ)() 

The address output array [ADR_0()] is used to pass a binary address. The maximum size of the 
array is specified as [ADR_O(63..0)]. However, the higher array boundary is specific to the ad- 
dress width of the core, and the lower array boundary is determined by the data port size (e.g. the 
maximum array size on a 32-bit data port is [ADR_0(63..2)]. In some cases (such as FIFO inter- 
faces) the array may not be present on the interface. 



CYC J) 

The cycle output [CYC_0], when asserted, indicates that a valid bus cycle is in progress. The 
signal is asserted for the duration of all bus cycles. For example, during a BLOCK transfer cycle 
there can be multiple data transfers. The [CYC_0] signal is asserted during the first data trans- 
fer, and remains asserted until the last data transfer. The [CYCO] signal is useful for interfaces 
with multi-port interfaces (such as dual port memories). In these cases, the [CYC_0] signal re- 
quests use of a common bus from an arbiter. Once the arbiter grants the bus to the MASTER, it 
is held until [CYC_0] is negated. 



ERRJ 

The error input [ERRJ] indicates an abnormal cycle termination. The source of the error, and 
the response generated by the MASTER is defined by the IP core supplier. Also see the 
[ACK_I] and [RTYJ] signal descriptions. 



RTY_I 

The retry input [RTYJ] indicates that the interface is not ready to accept or send data, and that 
the cycle should be retried. When and how the cycle is retried is defined by the IP core supplier. 
Also see the [ERRJ] and [RTYJ] signal descriptions. 

SELJ)() 

The select output array [SEL JD()] indicates where valid data is expected on the [DATJO] signal 
array during READ cycles, and where it is placed on the [DAT OQ] signal array during WRITE 
cycles. The array boundaries are determined by the granularity of a port. For example, if 8-bit 
granularity is used on a 64-bit port, then there would be an array of eight select signals with 
boundaries of [SEL_O(7..0)]. Each individual select signal correlates to one of eight active bytes 
on the 64-bit data port. For more information about [SEL OQ], please refer to the data organiza- 
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to section in Chapter 3 of this specification. Aiso see the [DATJO], [DAT.OO) and [STB_0, 
signal descriptions. 

STB -° • j • va transfer cvcle It is used to quality various 

The strobe output [STB.O] mdtcares a vahd dtt tmfe cycle. H _ 

Iddre-X type [TGA.OOl connrius bta-JM—* tSZtf^Z 
WISHBONE DATASHEET. 

^ G ?-P°. rxrr OOl contains information associated with bus cycles, and is qualified by 
Cycle tag type [TGC_0()J contains imonn acknowledge and cache control cycles can 

signal [CYCO]. For example data transfer, "^^^ d t0 discrim inate between 

^un^L^^ 

WE -° u, ♦ t rwF ni indicates whether the current local bus cycle is a READ or 
The write enable output [WE_0] indicates ™ne and is asserted during WRITE cycles. 
WRITE cycle. The signal is negated during READ cycles, and is assenea g 

4 7.3 VHDL Synthesis and Test 

.Ke VMBcore is or^ed * s a aentsof VHDL ^^Z^T^l 
top level entity known as 'VMECORbC.vtiu . nguic 
30 shows a functional diagram for the core. 

Each m „du,e of VHDL code is named £££ £££ 

rii, A :«;t1^ 
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Entity (circuit) name: 
Architecture name: 



VMECOREC 
VMECOREC1 



Entity/architecture filename: VMECOREC.VHD 
Test bench filename: VMECORET.VHD 
Text vector filename: VMECOREV.TXT 

Only the top level module in the hierarchy includes a VHDL test bench. It is assumed that all of 
the files are simulated and synthesized as a group. 



VMECOREC.VHD 








BUSSCNTC.VHD 






LOCIADRC.VHD 






LOCLDATC . VHD 






SLAVCNTC.VHD 






SLAVDECC.VHD 






VMEBADBC.VHD 



23 SEP 2002 

Figure 4-29. VMEcore entities. 



The entire set of VMEcore™ entities is created with a parametric core generator called the 
VMEbus Interface Writer™. The tool itself is a trade secret of Silicore Corporation, and is not 
available under any license. However, the output files provided as VMEcore entities, test 
benches and test vector files are fully readable with standard editors. They may be easily and 
fully modified and maintained without the parametric core generator. 
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VD 0(31..16) <• 



DAT 1(31.-24) 



VD I (31.. 00) 



>DAT_O{31..00) 



■>ADR_0(18..2) 



VA I (23.. 01) 




VAM_I(5..0) 
VNIACK I 



> A24SGLO 



SAC24_I(23..19) 
SAE24_I 



>SEL_0() 



VNDTACK_0 
VNBERR 



WISHBONE INTEREACE 



VMEbus INTEREACE 



Fi a„ rP Fun^tinn.l digram of the VMF.C OREC entity. 
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4.7.4 Bus Interface Timing 

Before routing the VMEcore(tm) interface, correct timing specifications must be entered into the 
design. Furthermore, these timing specifications must take into account not only the require- 
ments of the VMEcore design, but also of any external driver or receiver chips. These external 
chips are needed because FPGA and ASIC target devices aren't compatible with the electrical 
characteristics of the VMEbus backplane. 

The VMEcore logical design has only three simple timing constraints: 

o Input-to-clock setup time: 1 [VCLK] 
o Flop-to-flop transition: 1 [VCLK] 
o Clock-to-output delay time: 1 [VCLK] 

The input-to-clock setup time can be analyzed with Figures 4-3 1(a) and 4-32(a). This timing pa- 
rameter includes the time it takes for a VMEbus backplane signal to propagate through an input 
buffer, through the input pin of the target device and then to set up at the input of a flip-flop. 

For example, consider a VMEbus input signal that traverses through an input buffer that has a 
timing delay [Tplh] of 7.0 ns. Furthermore, assume that [VCLK] operates at a clock speed of 
50.000 MHz (or a period of 20.0 ns). This means that the target device must be routed with a 
worse case input-to-clock delay of: 

Tpdlhin(max) = Tvclk(min) - Tphl 

Tpdlhin(max) = 20.0 ns - 7.0 ns = 1 3.0 ns 

Although these numbers are given for the 'low-to-high ' transition delay, a similar case exists for 
the 'high-to-low' transition delay. The method by which this time specification is entered into 
the software development tools depends upon the target technology and router used. For exam- 
ple, the timespec for the VMEbus data strobe [N_VDS0] input would be entered into a Xilinx 
FPGA router thusly: 

NET "N VDS0" OFFSET = IN 13 ns BEFORE "VCLK"; 
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VMEbus 
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Tpdhlindnax) 
Tpdlhin(max) 



-0 
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BUFFER 



VCLK 
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Tpdhlout(max) - lh 
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Tpoe 



D , Q 
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Target Device 
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(a) VMEbus Input & Output Signals 



VMEbus 
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TpdhloutOnax) j 
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(b) VMEbus Bi-directional Signals 



23 SEP 2002 



Figure 4z3L De tennjning hus interfere tim in g characteristics. 
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VMEbus Signal 



Buffered VMEbus 
Input Signal 

VCLK (min) 
(Internal) 



Tphl -> 


Tvclk (min) 


Tplh 


Tvclk (min) 




1 


u Tpdhlin (max) 




, Tpdlhin (max) 
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! \ 
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(a) VMEcore Input Timing Constraints 



Device input to VCLK setup, high-to-low transition: Tpdhlin (max) = Tvclk (min) - Tphl 
Device input to VCLK setup, low-to-high transition: Tjxilhin(max) = Tvclk(min) - Tplh 



Device Output' 
Signal 



Buffered ' 
VMEbus Signal 



VCLK(min) AjjT 
(Internal) 



Tvclk (min) 
Tpdhlout (max) 



1 



• Tphl 



Tvclk (min) 
Tpdlhout (ma>:) 



J 



-Tplh 



(b) VMEcore Output Timing Constraints 

VCLK to device output, high-to-low transition: Tpdhlout (max) = Tvclk (min) - Tphl 
VCLK to device output, low-to-high transition: Tpdlhout (max) = Tvclk (min) - Tplh 
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Figure 4-32. VMEbus interface timing. 
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move from the output of one fl.p-flop and be sot-up atfte nput « 4 . 
^""nVctK^a, 5 So MH^'men Win)! is ,/VCLK, or 20.0 
ns. It would be entered into a Xilinx FPGA router thusly: 

NET "VCLK" PERIOD = 30.000; 

There are additiona, contain* place on WW** ™%^££P^ 
VMEbus is asynchronous it must be sampled 333 Mte or a period of 30.0 ns). 

strobe skew on the backplane. 



DSA* 



DSB* 



30 nS (min) 



VCLK (min) f<L 



J 



DSA* 



DSB* 



VCLK (max) 





20 nS (max) . 




! i 


\ 


/ 




L_ / 




15) \ > 





Ad 



(b) at VCTK (max) .50.000 MHz) « LEAST ONE ASSERTM jMPLE^^ 

— JM 24, 2002 

Firv , 0 4,33 VMFhus con ^int. place on ■ VCT ,K1 frequency. 



Register-transfer-logic. 
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The clock-to-output delay time is the time it takes for a synchronous signal to exit a flip-flop, 
propagate through the FPGA or ASIC target device, an output buffer, and then arrive at the 
VMEbus signal interconnection. 

For example, consider a VMEbus input signal that traverses through an output buffer that has a 
timing delay [Tplh] of 10.0 ns. Furthermore, assume that [VCLK] operates at a clock speed of 
50.000 MHz (or a period of 20.0 ns). This means that the target device must be routed with a 
worse case clock-to-output delay of: 

Tpdlhout(max) = Tvclk(min) - Tphl 

Tpdlhout(max) = 20.0 ns - 10.0 ns = 10.0 ns 

Although these numbers are given for the 'low-to-high' transition delay, a similar case exists for 
the 'high-to-low' transition delay. The method by which this time specification is entered into 
the software development tools depends upon the target technology and router used. For exam- 
ple, the timespec for the VMEbus data line [VD<0>] output would be entered into a Xilinx 
FPGA router thusly: 

NET "VD<0>" OFFSET = OUT 10 ns AFTER "VCLK"; 



The timing diagram of Figures 4-34 and 4-35 show how VMEbus read and write cycles are 
translated to WISHBONE read and write cycles. 
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VMEcore (tm) TIMING - VMEbus TO WISHBONE SINGLE WRITE CYCLE 
PARTICIPATING VMEbus SLAVE W/NORMAL TERMINATION 
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Figure 4-35. VMEbus to WISHBONE write cycle translation. 
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4.7.5 BUSSCNTC Entity 

The BUSSCNTC (bus controller) entity renames signals used in the core. 

4.7.6 LOCLADRC Entity 

The LOCLADRC entity ^ « 
WISHBONE address bus [ADR_O(18..02)j. ine upp h , , SoC address 

encoded into select signals [SEL_O(3..0)]. 



4.7.7 LOCLDATC Entity 

The LOCLDATC entity provides two functions: (a) it multiplexes VMEBns data to the corree. 
WISHBONE data bank and (b) it latches the data. 



4.7.8 SLAVCNTC Entity 

DTACK*, etc.) and other functions. The SLAVONIC emuy muu 
processes: 



• DCYC process 

• DTACK_GENR process 

• SEL_GENR process 

• SEQU process 

• STROBES process 

• SYNC process 

• TERMINATOR process 



The DCYC (data cyele ^rTZ^W™ 



This in- 
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The [VNDTACK_0] signal is connected to DTACK* on the VMEbus interface. During write 
cycles [VNDTACKJ3] is asserted one [CLKJ] edge after WISHBONE acknowledge [ACKJ] 
is asserted. During read cycles the interface waits for an extra clock cycle before asserting 
[VNDTACK_0]. This provides extra time to route data through the target device. If the 
WISHBONE [ERR I] signal is asserted instead of [ACKJ], then the [VNBERRO] signal will 
be asserted instead of [VNDTACK O]. This indicates that an error occurred during the cycle, 
and causes the VMEbus [BERR*] signal to be asserted. 

The SEL_GENR process drives the WISHBONE MASTER [SEL_O(3..0)] data select signal ar- 
ray. The signal array is used for bank select lines during data transfers. Figure 4-36 shows how 
the data select signals are asserted during VMEbus transfers. 

All of the select signals are asserted during the data load portion of a participating SLAVE cycle. 
This is indicated by the local [DLOAD] signal, which is generated by the SEQU process. Fur- 
thermore, the assertion of an individual signal in the [SEL_OQ] array depends on the type of 
VMEbus data cycle (D32 or D16(EO)) and the address of the transfer. If an individual signal is 
selected it remains asserted until the cycle is terminated (using [ACKJ], etc.), or when a VME- 
bus cycle is aborted (indicated by the negation of VMEbus [DSO*] or [DS1*]. 



VMEcore(tm) WISHBCNE MASTER Data Bus Routing - IDSIZE:32 


Cycle 


VMEbus Signals 
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2(1) | 
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2(2) | BYTI 
.D08 D07. 


:<3) | 

.DOO 



NOTES: EVEN - DS1* asserted during D16 or even byte transfer. 

ODD - DSO* asserted during D16 or odd byte transfer. 
LW* - VMEbus LWORD* Signal 
(DB) - Carries Data Bit . 
SEL_X() - For SLAVE or IREQ: 'X* => 'I 1 ; for MASTER or I HAND: 'X' => '0\ 



23 SEP 2002 

Figure 4-36. Data select signals during VMEbus transfers. 

The SEQU process is a state machine (sequencer) that generates master timing for the core. Fig- 
ures 4-37 and 4-38 show the state and timing diagrams for the process. 
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38 Tinnn, diagram (relativ glfor the ^FOI J process state machine. 
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The STROBES process drives the WISHBONE [CYC J)], [STB_0] and [WE J)] signals. For 
more information about these signals please refer to the WISHBONE specification. 

The SYNC process synchronizes the VMEbus AS*, DSO* and DS1* signals. It generates local 
signals [AS], [DS] and [SLDS]. 

The TERMINATOR process drives a common local signal [ACK] in response to the assertion of 
[ACKJ] or [ERR_I]. In general, either of these signals can be used to terminate a WISHBONE 
bus cycle. 

4.7.9 SLAVDECC Entity 

The SLAVDECC entity decodes the upper five bits of the VMEbus address bus and the address 
modifier code. When a participating VMEbus interface is selected, the entity asserts the address 
compare signal [ADC]. The circuit responds to address modifier codes 0x3E, 0x3D, 0x3 A and 
0x39. 

The upper five VMEbus address bits [VAJ(23..19)] are compared to local address compare bits 
[SAC24_I(23..19)]. When these match (along with the address modifier codes), the interface is 
selected for a bus cycle. 

The interface may be enabled or disabled with signal [SAE24J]. If the interface is to be perma- 
nently enabled, tie [SAE24J] to logic '1\ Otherwise, it may be used as a generic control signal 
to enable or disable the interface. 

4.7.10 VMEBADBC Entity 

The VMEBADBC entity is the data multiplexor and register for the VMEbus data out (VD_0()) 
signal array. During read cycles, VMEbus output data is routed (using multiplexors) from the 
WISHBONE data input signal array [DAT JO] to the correct VMEbus output data signal array 
[VD_0()]. The routing depends upon the type of VMEbus data transaction (i.e. D32, D16(E) 
WORD or D 1 6(0) WORD). 
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4.8 VMEPCIBR Entity 

The VMEPCIBR entity is a top level, dual WISHBO^ 

Public domain WISHBONE shared bus with mult.plexor interconnections 

in several ways. These modifications include: 

• Re-encoded, variable address decoder. 

• • .l u- uoct 'dtt ' level of the svstem. A description of the system 

bridge. These are called the A SIDE and B SIDE ^ wlsHB ONE MASTER, 

the 'A SIDE' interconnect™ the VMEb u t SLAVE (Viv r |C0RE PCI core) is the 

^O^tT^^^ 
SLAVES (i.e. registers and memory buffers). 

coders are staples, (and ^ ^£^VES are located a, other ad- 

'powers-of-two (2, 4, 8, 16, etc.). Howeve • » Functionally, this is a standard ad- 

dresses, so a von* <fe«-*r '^d Junrt y, 

dress decoder except mKh SSto This will alow the system down 

is asserted whenever an error address is selected. 
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Figure 4-39. Modified public domain WISHBONE shared bus. 
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Figure 4-41. VMEbus (A SIDE') error decoder. 
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4.9 VMEPCIBR_SOC Entity 

As shown in Figure 4-42, the highest 

The entity has two functions: (1) to define te™&^*^£yo pad P s . with the excep- 
level RTL in the SoC) and (2) to ^specify ^^J^^^aL are located here, 
tion of the PCI I/O pads (which Xihnx locates in their PCI core), an i/u p 
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Figure 4-42(a\ VMEPCIBR SOC system hierarchy. 
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NOTES: 

(1) Add or substitute the files shown during pre-synthesis simulation. 

The top level test bench named ' VMETOPCI_SOCt .VHD' is used for both the pre-synthesis and 
post-routing top level simulation. Also note that the test bench uses (reads) a test 
vector file named 'VMETOPCI SOCv.TXT' . If the simulator can't find the file, it probably 
means that its path (as declared in 'VMETOPCI_SOCt.VHD' ) is wrong. 

The unmodified Xilinx PCI wrapper named T pcim_lc_33_3_s.VHD' is used for synthesis. 
Substitute file 'pcim_lc.vhd' for pre-synthesis simulation. 

The Xilinx PCI core black-box primative named ' pcim_lc_i . ngo ' is incorporated into the 
system by the Xilinx router. Substitute file 'pcim_lc_i .vhd' for pre-synthesis simulation. 

Special instructions when using Models im: 

(a) After creating the project, be sure to check the "Use 1993 Language Syntax" 
under Options » Compile. 

(b) Compile all files from their source directory. 

(2) During synthesis, the VHDL source file 'VMETOPCI_SOCc.VHD' is the top level entity. 
Route the system with modified Xilinx user constraint file named 2s200pq208_32_33_mod.ucf . 

(3) Contains block or distributed RAM primatives generated by Xilinx XST synthesis software. 

(4) Silicore VMEcore(tm). Also note that all VMEcore(tm) files are in the VMECORE directory. 

(5} Xilinx LogiCORE (tm) PCI file modified for this application. File is located under 
directory named 'PCIMODS'. 



VMEPCIBR SOC System Heirarchy 
27 SEP 2002 

Figure 4-420)1 VMEPCIBR SOC system hierarchy (con't). 
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4.10 VPWWRAP Entity 



VPWWRAP is a VMEbus posted write wrapper * the VME« ^TvZt^ 

Sr^SSSTK^U - read and write c y c,es (te- 
spectively). 



1.X WRITE 



INPUT STATE MACHINE 
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OTHER EQUATIONS: 

^ ^ Tjrv- + ctr T * ACK I DATA SOURCE READ CYCLE 
^iK*!^*/** DATA SOURCE WRITE CYCLE 



STB 0 = RDC + STB_I 
~ + WRC + STB_I ' 

CYC_0 = STB_0; 



DATA DESTIM READ CYCLE 



OUTPUT STATE MACHINE 

SIGNALS: 

CLOCK: CLK_I 

INPUTS: ACK_I, LOAD 

STATES: PWR k 

OTHER EQUATIONS: 

LOAD * STB I * WE_I * WRC * /PWR; 
MODE = /PWR"; 
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FEB 21, 2002 



v, r ^ Funct ;— i diagram of the VPWWRAP entity. 
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Figure 4-44. VPWWRAP read cycle. 
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Fi gurej45. y pwww A P write cycle. 
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Appendix A - GNU LESSER GENERAL PUBLIC LICENSE 

GNU LESSER GENERAL PUBLIC LICENSE - Version 2.1, February 1999 

Copyright (C) 1991, 1999 Free Software Foundation, Inc. 59 Temple Place, Suite 330, Boston, 
MA 021 1 1-1307 USA Everyone is permitted to copy and distribute verbatim copies of this li- 
cense document, but changing it is not allowed. 

[This is the first released version of the Lesser GPL. It also counts as the successor of the GNU 
Library Public License, version 2, hence the version number 2.1.] 

PREAMBLE 

The licenses for most software are designed to take away your freedom to share and change it. 
By contrast, the GNU General Public Licenses are intended to guarantee your freedom to share 
and change free software-to make sure the software is free for all its users. 

This license, the Lesser General Public License, applies to some specially designated software 
packages-typically libraries--of the Free Software Foundation and other authors who decide to 
use it. You can use it too, but we suggest you first think carefully about whether this license or 
the ordinary General Public License is the better strategy to use in any particular case, based on 
the explanations below. 

When we speak of free software, we are referring to freedom of use, not price. Our General Pub- 
lic Licenses are designed to make sure that you have the freedom to distribute copies of free 
software (and charge for this service if you wish); that you receive source code or can get it if 
you want it; that you can change the software and use pieces of 
it in new free programs; and that you are informed that you can do these things. 

To protect your rights, we need to make restrictions that forbid distributors to deny you these 
rights or to ask you to surrender these rights. These restrictions translate to certain responsibili- 
ties for you if you distribute copies of the library or if you modify it. 

For example, if you distribute copies of the library, whether gratis or for a fee, you must give the 
recipients all the rights that we gave you. You must make sure that they, too, receive or can get 
the source code. If you link other code with the library, you must provide complete object files 
to the recipients, so that they can relink them with the library after making changes to the library 
and recompiling it. And you must show them these terms so they know their rights. 

We protect your rights with a two-step method: (1) we copyright the library, and (2) we offer you 
this license, which gives you legal permission to copy, distribute and/or modify the library. 

To protect each distributor, we want to make it very clear that there is no warranty for the free 
library. Also, if the library is modified by someone else and passed on, the recipients should 
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GNU LESSER GENERAL PUBLIC LICENSE TERMS AND CONDITIONS FOR COPY- 
ING, DISTRIBUTION AND MODIFICATION 

0. This License Agreement applies to any software library or other program which contains a no- 
tice placed by the copyright holder or other authorized party saying it may be distributed under 
the terms of this Lesser General Public License (also called "this License"). Each licensee is ad- 
dressed as "you". 

A "library" means a collection of software functions and/or data prepared so as to be conven- 
iently linked with application programs (which use some of those functions and data) to form 
executables. 

The "Library", below, refers to any such software library or work which has been distributed un- 
der these terms. A "work based on the Library" means either the Library or any derivative work 
under copyright law: that is to say, a work containing the Library or a portion of it, either verba- 
tim or with modifications and/or translated straightforwardly into another language. (Hereinaf- 
ter, translation is included without limitation in the term "modification".) 

"Source code" for a work means the preferred form of the work for making modifications to it. 
For a library, complete source code means all the source code for all modules it contains, plus 
any associated interface definition files, plus the scripts used to control compilation and installa- 
tion of the library. 

Activities other than copying, distribution and modification are not covered by this License; they 
are outside its scope. The act of running a program using the Library is not restricted, and output 
from such a program is covered only if its contents constitute a work based on the Library (inde- 
pendent of the use of the Library in a tool for writing it). Whether that is true depends on what 
the Library does and what the program that uses the Library does. 

1 . You may copy and distribute verbatim copies of the Library's complete source code as you 
receive it, in any medium, provided that you conspicuously and appropriately publish on each 
copy an appropriate copyright notice and disclaimer of warranty; keep intact all the notices that 
refer to this License and to the absence of any warranty; and distribute a copy of this License 
along with the Library. 

You may charge a fee for the physical act of transferring a copy, and you may at your option of- 
fer warranty protection in exchange for a fee. 

2. You may modify your copy or copies of the Library or any portion of it, thus forming a work 
based on the Library, and copy and distribute such modifications or work under the terms of Sec- 
tion 1 above, provided that you also meet all of these conditions: 

a) The modified work must itself be a software library. 

b) You must cause the files modified to carry prominent notices stating that you changed 
the files and the date of any change. 
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c) You must cause the whole of the work to be licensed a, no charge to all third parties 
under the terms of this License. 

and performs whatever part of its purpose remains meaningful. 
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4. You may copy and distribute the Library (or a portion or derivative of it, under Section 2) in 
object code or executable form under the terms of Sections 1 and 2 above provided that you ac- 
company it with the complete corresponding machine-readable source code, which must be dis- 
tributed under the terms of Sections 1 and 2 above on a medium customarily used for software 
interchange. 

If distribution of object code is made by offering access to copy from a designated place, then 
offering equivalent access to copy the source code from the same place satisfies the requirement 
to distribute the source code, even though third parties are not compelled to copy the source 
along with the object code. 

5. A program that contains no derivative of any portion of the Library, but is designed to work 
with the Library by being compiled or linked with it, is called a "work that uses the Library". 
Such a work, in isolation, is not a derivative work of the Library, and therefore falls outside the 
scope of this License. 

However, linking a "work that uses the Library" with the Library creates an executable that is a 
derivative of the Library (because it contains portions of the Library), rather than a "work that 
uses the library". The executable is therefore covered by this License. Section 6 states terms for 
distribution of such executables. 

When a "work that uses the Library" uses material from a header file that is part of the Library, 
the object code for the work may be a derivative work of the Library even though the source 
code is not. Whether this is true is especially significant if the work can be linked without the 
Library, or if the work is itself a library. The threshold for this to be true is not precisely defined 
by law. 

If such an object file uses only numerical parameters, data structure layouts and accessors, and 
small macros and small inline functions (ten lines or less in length), then the use of the object file 
is unrestricted, regardless of whether it is legally a derivative work. (Executables containing this 
object code plus portions of the Library will still fall under Section 6.) 

Otherwise, if the work is a derivative of the Library, you may distribute the object code for the 
work under the terms of Section 6. Any executables containing that work also fall under Section 

6. whether or not they are linked directly with the Library itself. 

6. As an exception to the Sections above, you may also combine or link a "work that uses the Li- 
brary" with the Library to produce a work containing portions of the Library, and distribute that 
work under terms of your choice, provided that the terms permit modification of the work for the 
customer's own use and reverse engineering for debugging such modifications. 

You must give prominent notice with each copy of the work that the Library is used in it and that 
the Library and its use are covered by this License. You must supply a copy of this License. If 
the work during execution displays copyright notices, you must include the copyright notice for 
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a) Accompany the combined library with a copy of the same work based on the Library, 
uncombined with any other library facilities. This must be distributed under the terms 
of the Sections above. 

b) Give prominent notice with the combined library of the fact that part of it is a work 
based on the Library, and explaining where to find the accompanying uncombined form 
of the same work. 

8. You may not copy, modify, sublicense, link with, or distribute the Library except as expressly 
provided under this License. Any attempt otherwise to copy, modify, sublicense, link with, or 
distribute the Library is void, and will automatically terminate your rights under this License. 
However, parties who have received copies, or rights, from you under this License will not have 
their licenses terminated so long as such parties remain in full compliance. 

9. You are not required to accept this License, since you have not signed it. However, nothing 
else grants you permission to modify or distribute the Library or its derivative works. These ac- 
tions are prohibited by law if you do not accept this License. Therefore, by modifying or distrib- 
uting the Library (or any work based on the Library), you indicate your acceptance of this Li- 
cense to do so, and all its terms and conditions for copying, distributing or modifying the Library 
or works based on it. 

10. Each time you redistribute the Library (or any work based on the Library), the recipient 
automatically receives a license from the original licensor to copy, distribute, link with or modify 
the Library subject to these terms and conditions. You may not impose any further restrictions 
on the recipients 1 exercise of the rights granted herein. You are not responsible for enforcing 
compliance by third parties with this License. 

1 1. If, as a consequence of a court judgment or allegation of patent infringement or for any other 
reason (not limited to patent issues), conditions are imposed on you (whether by court order, 
agreement or otherwise) that contradict the conditions of this License, they do not excuse you 
from the conditions of this License. If you cannot distribute so as to satisfy simultaneously your 
obligations under this License and any other pertinent obligations, then as a consequence you 
may not distribute the Library at all. For example, if a patent license would not permit royalty- 
free redistribution of the Library by all those who receive copies directly or indirectly through 
you, then the only way you could satisfy both it and this License would be to refrain entirely 
from distribution of the Library. 

If any portion of this section is held invalid or unenforceable under any particular circumstance, 
the balance of the section is intended to apply, and the section as a whole is intended to apply in 
other circumstances. 

It is not the purpose of this section to induce you to infringe any patents or other property right 
claims or to contest validity of any such claims; this section has the sole purpose of protecting 
the integrity of the free software distribution system which is implemented by public license 
practices. Many people have made generous contributions to the wide range of software distrib- 
uted through that system in reliance on consistent application of that system; it is up to the au- 



117 



tbor/donor to decide if he or she is wining .0 distribute software through any other system and a 
licensee cannot impose that choice. 

This section is intended .0 make thoroughly clear what is believed to be a consequence of the 
rest of this License. 

a, o^^fth^Tihrarv is restricted in certain countries either by patents 

grates the limitation as if written in the body of thts Ucense. 
vetion but may differ in detail to address new problems or concerns. 

of this License which applies to it and any later version you J Software 

sion ever published by the Free Software Foundatton. 

14 . If you wish to incorporate parts of the Libr^ ^££*S^ F £3£ 

conditions are incompatible with these, wrtte to tire author to a* to P™*^ Foundation; 

which is copyrighted by the *■ «• «-h «* P- 

SSSlS ot I' fte'Hoftware arfc. of promoting .he sharing and 

reuse of software generally. 

NO WARRANTY 

15 . BECAUSE THE LIBRARY 

,« ,N NO EVENT UNLESS REQUIRED BY AMCABLE S LAW OR AGREED TOW 
WRITING WILL ANY COPYRIGHT HOLDER. R f ^o^, BE LIABLE 

MODIFY AND/OR REDISTRIBUTE ^HE^LIBRARY AS AB ^ C1DENTAL OR 

SSSoSS OSS OPHNABILITY TO USE THE 



118 



LIBRARY (INCLUDING BUT NOT LIMITED TO LOSS OF DATA OR DATA BEING 
RENDERED INACCURATE OR LOSSES SUSTAINED BY YOU OR THIRD PARTIES OR 
A FAILURE OF THE LIBRARY TO OPERATE WITH ANY OTHER SOFTWARE), EVEN 
IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF 
SUCH DAMAGES. 

END OF TERMS AND CONDITIONS 

HOW TO APPLY THESE TERMS TO YOUR NEW LIBRARIES 

If you develop a new library, and you want it to be of the greatest possible use to the public, we 
recommend making it free software that everyone can redistribute and change. You can do so by 
permitting redistribution under these terms (or, alternatively, under the terms of the ordinary 
General Public License). 

To apply these terms, attach the following notices to the library. It is safest to attach them to the 
start of each source file to most effectively convey the exclusion of warranty; and each file 
should have at least the "copyright" line and a pointer to where the full notice is found. 

<one line to give the library's name and a brief idea of what it does.> 
Copyright (C) <year> <name of author> 

This library is free software; you can redistribute it and/or modify it under the terms of 
the GNU Lesser General Public License as published by the Free Software Foundation; 
either version 2.1 of the License, or (at your option) any later version. 

This library is distributed in the hope that it will be useful, but WITHOUT ANY 
WARRANTY; without even the implied warranty of MERCHANTABILITY or FIT- 
NESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public Li- 
cense for more details. 

You should have received a copy of the GNU Lesser General Public License along with 
this library; if not, write to the Free Software Foundation, Inc., 59 Temple Place, Suite 
330, Boston, MA 02111-1307 USA 

Also add information on how to contact you by electronic and paper mail. 

You should also get your employer (if you work as a programmer) or your school, if any, to sign 
a "copyright disclaimer" for the library, ifnecessary. Here is a sample; alter the names: 

Yoyodyne, Inc., hereby disclaims all copyright interest in the library Trob f (a library for tweak- 
ing knobs) written by James Random Hacker. 
<signature of Ty Coon>, 1 April 1990 
Ty Coon, President of Vice 

That's all there is to it! 
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Appendix C - GNU Free Documentation License 

GNU Free Documentation License - Version 1.2, November 2002 

r -„ht ir\ 7000 2001 2002 Free Software Foundation, Inc. 59 Temple Place Suite 330 

^ everyone is permitted to copy and distribute verbal cop.es of 
this license document, but changing it is not allowed. 



0. PREAMBLE 

being considered responsible for modifications made by others, 
a copyleft license designed for free software. 
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1 APPLICABILITY AND DEFINITIONS 

^IfTX'l atensTand isiTsic, as -you", "you accept the license if you 
cTpy modi *ordiitc A= work in a way requiring permission under copynght .aw. 

A "Modified Version" of the Document means any work containing the Document or a portion 
rf J eite tpied verbatim, or with modifications and/or translated into another ianguage. 

Z££SZ^2E£S ^e reiarionship could be a matter of historical connec- 
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tion with the subject or with related matters, or of legal, commercial, philosophical, ethical or 
political position regarding them. 

The "Invariant Sections" are certain Secondary Sections whose titles* are designated, as being 
those of Invariant Sections, in the notice that says that the Document is released under this Li- 
cense. If a section does not fit the above definition of Secondary then it is not allowed to be des- 
ignated as Invariant. The Document may contain zero Invariant Sections. If the Document does 
not identify any Invariant Sections then there are none. 

The "Cover Texts" are certain short passages of text that are listed, as Front-Cover Texts or 
Back-Cover Texts, in the notice that says that the Document is released under this License. A 
Front-Cover Text may be at most 5 words, and a Back-Cover Text may be at most 25 words. 

A "Transparent" copy of the Document means a machine-readable copy, represented in a format 
whose specification is available to the general public, that is suitable for revising the document 
straightforwardly with generic text editors or (for images composed of pixels) generic paint pro- 
grams or (for drawings) some widely available drawing editor, and that is suitable for input to 
text formatters or for automatic translation to a variety of formats suitable for input to text for- 
matters. A copy made in an otherwise Transparent file format whose markup, or absence of 
markup, has been arranged to thwart or discourage subsequent modification by readers is not 
Transparent. An image format is not Transparent if used for any substantial amount of text. A 
copy that is not "Transparent" is called "Opaque". 

Examples of suitable formats for Transparent copies include plain ASCII without markup, Tex- 
info input format, LaTeX input format, SGML or XML using a publicly available DTD, and 
standard-conforming simple HTML, PostScript or PDF designed for human modification. Ex- 
amples of transparent image formats include PNG, XCF and JPG. Opaque formats include pro- 
prietary formats that can be read and edited only by proprietary word processors, SGML or XML 
for which the DTD and/or processing tools are not generally available, and the machine- 
generated HTML, PostScript or PDF produced by some word processors for output purposes 
only. 

The "Title Page" means, for a printed book, the title page itself, plus such following pages as are 
needed to hold, legibly, the material this License requires to appear in the title page. For works 
in formats which do not have any title page as such, "Title Page" means the text near the most 
prominent appearance of the work's title, preceding the beginning of the body of the text. 

A section "Entitled XYZ" means a named subunit of the Document whose title either is precisely 
XYZ or contains XYZ in parentheses following text that translates XYZ in another language. 
(Here XYZ stands for a specific section name mentioned below, such as "Acknowledgements", 
"Dedications", "Endorsements", or "History".) To "Preserve the Title" of such a section when 
you modify the Document means that it remains a section "Entitled XYZ" according to this defi- 
nition. 

The Document may include Warranty Disclaimers next to the notice which states that this Li- 
cense applies to the Document. These Warranty Disclaimers are considered to be included by 
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2 VERBATIM COPYING 

daily, provided that this L.cense, the copyright nonces, ana t editions 
=vS^^ 

35 -* — — - 

must also follow the conditions in section 3. 

You may also lend eopies, under the same conditions stated above, and you may publicly display 
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3. COPYING IN QUANTITY 
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It is requested, but not required, that you contact the authors of the Document well before redis- 
tributing any large number of copies, to give them a chance to provide you with an updated ver- 
sion of the Document. 

4. MODIFICATIONS 

You may copy and distribute a Modified Version of the Document under the conditions of sec- 
tions 2 and 3 above, provided that you release the Modified Version under precisely this License, 
with the Modified Version filling the role of the Document, thus licensing distribution and modi- 
fication of the Modified Version to whoever possesses a copy of it. In addition, you must do 
these things in the Modified Version: 

A. Use in the Title Page (and on the covers, if any) a title distinct from that of the Docu- 
ment, and from those of previous versions (which should, if there were any, be listed in 
the History section of the Document). You may use the same title as a previous version 
if the original publisher of that version gives permission. 

B List on the Title Page, as authors, one or more persons or entities responsible for au- 
thorship of the modifications in the Modified Version, together with at least five of the 
principal authors of the Document (all of its principal authors, if it has fewer than five), 
unless they release you from this requirement. 

C. State on the Title page the name of the publisher of the Modified Version, as the pub- 
lisher. 

D. Preserve all the copyright notices of the Document. 

E. Add an appropriate copyright notice for your modifications adjacent to the other copy- 
right notices. 

F. Include, immediately after the copyright notices, a license notice giving the public per- 
mission to use the Modified Version under the terms of this License, in the form shown 
in the Addendum below. 

G. Preserve in that license notice the full lists of Invariant Sections and required Cover 
Texts given in the Document's license notice. 

H. Include an unaltered copy of this License. 

I. Preserve the section Entitled "History", Preserve its Title, and add to it an item stating 
at least the title, year, new authors, and publisher of the Modified Version as given on 
the Title Page. If there is no section Entitled "History" in the Document, create one 
stating the title, year, authors, and publisher of the Document as given on its Title Page, 
then add an item describing the Modified Version as stated in the previous sentence. 
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5. COMBINING DOCUMENTS 
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You may combine the Document with other documents released under this License, under the 
terms defined in section 4 above for modified versions, provided that you include in the combi- 
nation all of the Invariant Sections of all of the original documents, unmodified, and list them all 
as Invariant Sections of your combined work in its license notice, and that you preserve all their 
Warranty Disclaimers. 

The combined work need only contain one copy of this License, and multiple identical Invariant 
Sections may be replaced with a single copy. If there are multiple Invariant Sections with the 
same name but different contents, make the title of each such section unique by adding at the end 
of it, in parentheses, the name of the original author or publisher of that section if known, or else 
a unique number. Make the same adjustment to the section titles in the list of Invariant Sections 
in the license notice of the combined work. 

In the combination, you must combine any sections Entitled "History" in the various original 
documents, forming one section Entitled "History"; likewise combine any sections Entitled "Ac- 
knowledgements", and any sections Entitled "Dedications". You must delete all sections Entitled 
"Endorsements". 



6. COLLECTIONS OF DOCUMENTS 

You may make a collection consisting of the Document and other documents released under this 
License, and replace the individual copies of this License in the various documents with a single 
copy that is included in the collection, provided that you follow the rules of this License for ver- 
batim copying of each of the documents in all other respects. 

You may extract a single document from such a collection, and distribute it individually under 
this License, provided you insert a copy of this License into the extracted document, and follow 
this License in all other respects regarding verbatim copying of that document. 

7. AGGREGATION WITH INDEPENDENT WORKS 

A compilation of the Document or its derivatives with other separate and independent documents 
or works, in or on a volume of a storage or distribution medium, is called an "aggregate" if the 
copyright resulting from the compilation is not used to limit the legal rights of the compilation's 
users beyond what the individual works permit. When the Document is included in an aggregate, 
this License does not apply to the other works in the aggregate which are not themselves deriva- 
tive works of the Document. 

If the Cover Text requirement of section 3 is applicable to these copies of the Document, then if 
the Document is less than one half of the entire aggregate, the Document's Cover Texts may be 
placed on covers that bracket the Document within the aggregate, or the electronic equivalent of 
covers if the Document is in electronic form. Otherwise they must appear on printed covers that 
bracket the whole aggregate. 
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8. TRANSLATION 

Transition is considered a Kind of modification, so ^ ^^^It^s^ 
men, nnder «he terms of secton * Replacing nvanant &£. «h — ^ 
permission from their copyright no ders, bu you may '" c y indude a 

ant Sections in addition to the original versions of these Invariant Sections. 

™n^,^^ 

claimer, the original version will prevail. 



title. 

9. TERMINATION 



a;k, chiirense or distribute the Document except as expressly provided 
You may not copy, modify, sublicense, or distnouie u ... or distr }bute the Docu- 

10. FUTURE REVISIONS OF THIS LICENSE 

Th e Free Software Foundation may publish new re^edv ersions ^^^^Z 
'jm L ard~H^ 

Each version of the License is given a ^Xt^X^t^ 
that a particular numbered version of this License or any ater version app , . y 

the optic of following the terms and ^°^'tf^t^t^ « *• Di- 
version that has been published %£?£2?SZ any version ever pub- 
ment does not specify a version number of this License, you n y 
lished (not as a draft) by the Free Software Foundation. 



ADDENDUM: How to use this License for your documents 

Tn use this License in a document you have written, include a copy of the Lice 
men, and put me following copyright and license notices jus, after me „,le page. 
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Copyright (c) YEAR YOUR NAME. 

Permission is granted to copy, distribute and/or modify this document 

under the terms of the GNU Free Documentation License, Version 1 .2 

or any later version published by the Free Software Foundation; 

with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. 

A copy of the license is included in the section entitled "GNU 

Free Documentation License". 

If you have Invariant Sections, Front-Cover Texts and Back-Cover Texts, replace the 
"with.. .Texts/' line with this: 

with the Invariant Sections being LIST THEIR TITLES, with the 
Front-Cover Texts being LIST, and with the Back-Cover Texts being LIST. 

If you have Invariant Sections without Cover Texts, or some other combination of the three, 
merge those two alternatives to suit the situation. 

If your document contains nontrivial examples of program code, we recommend releasing these 
examples in parallel under your choice of free software license, such as the GNU General Public 
License, to permit their use in free software. 
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Chapter 1 - Introduction 



The WISHBONE 1 System-on-Chip (SoC) Interconnection Architecture for Portable IP Cores is a 
flexible design methodology for use with semiconductor IP cores. Its purpose is to foster design 
reuse by alleviating System-on-Chip integration problems. This is accomplished by creating a 
common interface between IP cores. This improves the portability and reliability of the system, 
and results in faster time-to-market for the end user. 

Previously, IP cores used non-standard interconnection schemes that made them difficult to inte- 
grate. This required the creation of custom glue logic to connect each of the cores together. By 
adopting a standard interconnection scheme, the cores can be integrated more quickly and easily 
by the end user. 

This specification can be used for soft core, firm core or hard core IP. Since firm and hard cores 
are generally conceived as soft cores, the specification is written from that standpoint. 

This specification does not require the use of specific development tools or target hardware. 
Furthermore, it is fully compliant with virtually all logic synthesis tools. However, the examples 
presented in the specification do use the VHDL hardware description language. These are pre- 
sented only as a convenience to the reader, and should be readily understood by users of other 
hardware description languages (such as Verilog®). Schematic based tools can also be used. 

The WISHBONE interconnect is intended as a general purpose interface. As such, it defines the 
standard data exchange between IP core modules. It does not attempt to regulate the application- 
specific functions of the IP core. 

The WISHBONE architects were strongly influenced by three factors. First, there was a need for 
a good, reliable System-on-Chip integration solution. Second, there was a need for a common 
interface specification to facilitate structured design methodologies on large project teams. 
Third, they were impressed by the traditional system integration solutions afforded by micro- 
computer buses such as PCI bus and VMEbus. 

In fact, the WISHBONE architecture is analogous to a microcomputer bus in that that they both: 
(a) offer a flexible integration solution that can be easily tailored to a specific application; (b) 
offer a variety of bus cycles and data path widths to solve various system problems; and (c) al- 
low products to be designed by a variety of suppliers (thereby driving down price while improv- 
ing performance and quality). 



1 Webster's dictionary defines a WISHBONE as "the forked clavicle in front of the breastbone of most birds." The 
term 'WISHBONE interconnect' was coined by Wade Peterson of Silicore Corporation. During the initial definition 
of the scheme he was attempting to find a name that was descriptive of a bi-directional data bus that used either 
multiplexors or three-state logic. This was solved by forming an interface with separate input and output paths. 
When these paths are connected to three-state logic it forms a 'Y' shaped configuration that resembles a wishbone. 
The actual name was conceived during a Thanksgiving Day dinner that included roast turkey. Thanksgiving Day is 
a national holiday in the United States, and is observed on the third Thursday in November. It is generally cele- 
brated with a traditional turkey dinner. 



WISHBONE SoC Architecture Specification, Revision B.2 



7 



Tctn " sources. These do no. exist in mieroeomputer buses beeaase they are l,m,«ed by 1C 
packaging and mechanical connectors. 

Th* WKHRONE architects have attempted to create a specification that is robust enough to in- 
goals have been accomplished with the publication of this document. 
1.1 WISHBONE Features 

The WISHBONE interconnection makes System-on-Chip and design reuse easy by creating a 
standard data exchange protocol. Features of this technology include: 

. Simple, compact, logical IP core hardware interfaces that require very few logic gates. 
. Supports structured design methodologies used by large project teams. 

• Full set of popular data transfer bus protocols including: 

- READ/WRITE cycle 

- BLOCK transfer cycle 

- RMW cycle 

• Data bus widths 2 and operand sizes up to 64-bits. 

. Supports both BIG END1AN and LITTLE END1AN data ordering. 

. Variable core interconnection methods support point-to-point, shared bus, crossbar 
switch, and switched fabric interconnections. 

. Handshaking protocol allows each IP core to throttle its data transfer speed. 

• Supports single clock data transfers. 

. Supports normal cycle termination, retry termination and termination due to error. 

• Address widths 3 up to 64-bits. 

ally used in FIFO interfaces). 

8 
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o Partial address decoding scheme for SLAVEs. This facilitates high speed address de- 
coding, uses less redundant logic and supports variable address sizing and interconnec- 
tion means. 

o User-defined tag support. This is useful for identifying transfers such as: 

- Data transfers 

- Interrupt vectors 

- Cache control operations 

o MASTER / SLAVE architecture for very flexible system designs. 

o Multiprocessing (multi-MASTER) capabilities. This allows for a wide variety of Sys- 
tem-on-Chip configurations. 

o Arbitration methodology is defined by the end user (priority arbiter, round-robin arbi- 
ter, etc.). 

o Supports various IP core interconnection means, including: 

- Unidirectional bus 

- Bi-directional bus 

- Multiplexor based interconnections 

- Three-state based interconnections 

- Off chip I/O 

o Synchronous design assures portability, simplicity and ease of use. 
o Very simple, variable timing specification. 

o Documentation requirements allow the end user to quickly evaluate interface needs. 

o Independent of hardware technology (FPGA, ASIC, etc.). 

o Independent of delivery method (soft, firm or hard core). 

o Independent of synthesis tool, router and layout tool technology. 

o Independent of FPGA and ASIC test methodologies. 

o Seamless design progression between FPGA prototypes and ASIC production chips. 



3 Specifications are given for address widths between zero and 64-bits. However, the basic architecture can theo- 
retically support any address width. 
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1.2 WISHBONE Objectives 

The main objective of the specification is to create a flexible interconnection means for use with 
Iemi3ucto;Tcores. This allows various IP core modules to be connected together to form a 
System-on-Chip. 

A further objective of the specification is to enforce good compatibility between IP core mod- 
ules. This enhances design reuse. 

A further objective of the specification is to create a robust standard, but one that does not un- 
duly constrain the creativity of the core developer or the end user. 

A further objective of the specification is to make it easy to understand by both the core devel- 
oper and the end user. 

A further objective of the specification is to facilitate structured design methodologies on large 
pro?* am ^ structured design, individual team members can build and test smal parts of 
Kesian Each member of the design team can interface to the common, well-defined WISH- 
SSS^iSSr^i all of the" sub-assemblies have been completed, the lull system can 
be integrated. 

A further objective of the specification is create a portable interface that i ^dependent t of "the 
underlySg Semiconductor technology. For example, the interconnect must be capable of work- 
ing with both FPGA and ASIC hardware target devices. 

A further objective of the specification is to make the interface independent of logic signaling 
levels. 

A further objective of the specification is to create a flexible int e™™*^*™^ 
pendent of the IP core delivery method. For example, it may be used with soft core , firm core 
or 'hard core' delivery methods. 

A further objective of the specification is to be independent of the underlying hardware descrip- 
tion lor example, soft cores may be written and synthesized in VHDL, Verilog® or some other 
hardware description language. Schematic entry may also be used. 

A further objective of the specification is to require a minimum standard for documentation. 
This allows IP core users to quickly evaluate and integrate new cores. 

A further objective of the specification is to eliminate extensive interface document^ ion on the 
part of the IP core developer. In most cases, this specification along with the WISHBONE DA 
1ASHEET is sufficient to completely document an IP core data interface. 

A further objective of the specification is to identify critical ^-^^^^ 
technologies, and to place them into the public domam at the earliest possible date. This makes 
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it more difficult for individuals and organizations to create proprietary SoC technologies through 
the use of patent, trademark, copyright and trade secret protection mechanisms. This objective 
applies only to the interconnection of IP cores, but not to the IP cores themselves. 

A further objective is to create an architecture that has a smooth transition path to support new 
technologies. This increases the longevity of the specification as it can adapt to new, and as yet 
un-thought-of, requirements. 

A further objective is to create an architecture that allows various interconnection means be- 
tween IP core modules. This insures that the end user can tailor the System-on-Chip to his/her 
own needs. For example, the entire interconnection system (which is analogous to a backplane 
on a standard microcomputer bus like VMEbus or cPCI) can be created by the system integrator. 
This allows the interconnection to be tailored to the final target device. 

A further objective is to create an architecture that requires a minimum of glue logic. In some 
cases the System-on-Chip needs no glue logic whatsoever. However, in other cases the end user 
may choose to use a more sophisticated interconnection method (for example with FIFO memo- 
ries or crossbar switches) that requires additional glue logic. 

A further objective is to create an architecture with variable address and data path widths to meet 
a wide variety of system requirements. 

A further objective is to create an architecture that fully supports the automatic generation of in- 
terconnection systems. This allows the WISHBONE interconnection to be generated with para- 
metric core generators. 

A further objective is to create an architecture that supports both BIG ENDIAN and LITTLE 
ENDIAN data transfer organizations. 

A further objective is to create an architecture that supports one data transfer per clock cycle. 

A further objective is to create a flexible architecture that allows data to be tagged. TAGs are 
user defined signals that allow each IP core to communicate with the rest of the system. They 
are especially useful when novel or unusual control signals (such as parity, cache control or in- 
terrupt acknowledge) are needed on an interface. 

A farther objective is to create an architecture with a MASTER/SLAVE topology. Furthermore, 
the system must be capable of supporting multiple MASTERS and multiple SLAVEs with an ef- 
ficient arbitration mechanism. 

A further objective is to create an architecture that supports point-to-point interconnections be- 
tween IP cores. 

A further objective is to create an architecture that supports shared bus interconnections between 
IP cores. 
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A further objective is to create an architecture that supports crossbar switches between IP cores. 
A further objective is to create an architecture that supports switched fabrics. 

A further objective is to create a synchronous protocol to insure ease of use, good reliability and 
easy testing. Furthermore, all transactions can be coordinated by a single clock. 

A further objective is to create a synchronous protocol that works over a wide range of interface 
clock speeds. The effects of this are: (a) that the WISHBONE interface can work synchronously 
with all attached IP cores, (b) that the interface can be used on a large range of target devices, (c) 
that the timing specification is much simpler and (d) that the resulting semiconductor device is 
much more testable. 

A further objective is to create a variable timing mechanism whereby the system clock frequency 
can be adjusted so as to control the power consumption of the integrated circuit. 

A further objective is to create a synchronous protocol that provides a simple timing specifica- 
tion. This makes the interface very easy to integrate. 

A further objective is to create a synchronous protocol where each MASTER and SLAVE can 
throttle the data transfer rate with a handshaking mechanism. 

A further objective is to create a synchronous protocol that is optimized for System-on-Chip, but 
that is also suitable for off-chip I/O routing. Generally, the off-chip WISHBONE interconnect 
will operate at slower speeds. 



1.3 Specification Terminology 

To avoid confusion, and to clarify the requirements for compliance, this specification makes use 
of five keywords to define the operation of the WISHBONE interconnect. The keywords are: 



• RULE 

• RECOMMENDATION 

• SUGGESTION 

• PERMISSION 

• OBSERVATION 

Any text not labeled with one of these keywords describes the operation in a narrative style, 
keywords are defined as follows: 



RULE 

Rules form the basic framework of the specification. They are sometimes expressed in text form 
and sometimes in the form of figures, tables or drawings. All rules MUST be followed to ensure 
compatibility between interfaces. Rules are characterized by an imperative style. The upper- 
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case words MUST and MUST NOT are reserved exclusively for stating rules in this document, 
and are not used for any other purpose. 

RECOMMENDATION 

Whenever a recommendation appears, designers would be wise to take the advice given. Doing 
otherwise might result in some awkward problems or poor performance. While this specification 
has been designed to support high performance systems, it is possible to create an interconnec- 
tion that complies with all the rules, but has very poor performance. In many cases a designer 
needs a certain level of experience with the system architecture in order to design interfaces that 
deliver top performance. Recommendations found in this document are based on this kind of 
experience and are provided as guidance for the user. 

SUGGESTION 

A suggestion contains advice which is helpful but not vital. The reader is encouraged to consider 
the advice before discarding it. Some design decisions are difficult until experience has been 
gained. Suggestions help a designer who has not yet gained this experience. Some suggestions 
have to do with designing compatible interconnections, or with making system integration easier. 

PERMISSION 

In some cases a rule does not specifically prohibit a certain design approach, but the reader might 
be left wondering whether that approach might violate the spirit of the rule, or whether it might 
lead to some subtle problem. Permissions reassure the reader that a certain approach is accept- 
able and will not cause problems. The upper-case word MAY is reserved exclusively for stating 
a permission and is not used for any other purpose. 

OBSERVATION 

Observations do not offer any specific advice. They usually clarify what has just been discussed. 
They spell out the implications of certain rules and bring attention to things that might otherwise 
be overlooked. They also give the rationale behind certain rules, so that the reader understands 
why the rule must be followed. 



1.4 Use of Timing Diagrams 

Figure 1-1 shows some of the key features of the timing diagrams in this specification. Unless 
otherwise noted, the MASTER signal names are referenced in the timing diagrams. In some 
cases the MASTER and SLAVE signal names are different. For example, in the single MAS- 
TER / single SLAVE configuration, the [ADR_0] and [ADR I] signals are connected together. 
Furthermore, the actual waveforms at the SLAVE may vary from those at the MASTER. That's 
because the MASTER and SLAVE interfaces can be connected together in different ways. Un- 
less otherwise noted, the timing diagrams refer to the connection diagram shown in Figure 1-2. 
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Figure 1-1 . Use of tim in g diagrams. 
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Individual signals may or may not be present on an specific interface. That's because many of 
the signals are optional. 

Two symbols are also presented in relation to the [CLK_I] signal. These include the positive 
going clock edge transition point and the clock edge number. In most diagrams a vertical guide- 
line is shown at the positive-going edge of each [CLKJ] transition. This represents the theoreti- 
cal transition point at which flip-flops register their input value, and transfer it to their output. 
The exact level of this transition point varies depending upon the technology used in the target 
device. The clock edge number is included as a convenience so that specific points in the timing 
diagram may be referenced in the text. The clock edge number in one timing diagram is not re- 
lated to the clock edge number in another diagram. 

Gaps in the timing waveforms may be shown. These indicate either: (a) a wait state or (b) a por- 
tion of the waveform that is not of interest (in the context of the diagram). When the gap indi- 
cates a wait state, the symbols '-WSM-' or '-WSS-' are placed in the gap along the [CLKJ] 
waveform. These correspond to wait states inserted by the MASTER or SLAVE interfaces re- 
spectively. They also indicate that the signals (with the exception of clock transitions and 
hatched regions) will remain in a steady state during that time. 

Undefined signal levels are indicated by a hatched region. This region indicates that the signal 
level is undefined, and may take any state. It also indicates that the current state is undefined, 
and should not be relied upon. When signal arrays are used, stable and predictable signal levels 
are indicated with the word 'VALID'. 



1.5 Signal Naming Conventions 

All signal names used in this specification have the ' V or '0' characters attached to them. 
These indicate if the signals are an input (to the core) or an output (from the core). For example, 
[ACKJ] is an input and [ACKJD] is an output. This convention is used to clearly identify the 
direction of each signal. 

In some cases, the input and output characters T and 'O' may be omitted and replaced by an 
4 X\ For example: [TAG3_X]. This is not an actual signal name, but rather a shorthand form to 
indicate both the [TAG3_I] and [TAG3J3] signal. 

Signal arrays are identified by a name followed by the array boundaries in parenthesis. For ex- 
ample, [DAT_I(63..0)] is a signal array with upper array boundary number sixty-three, and lower 
array boundary number zero. Furthermore, the array boundaries indicate the full range of the 
permissible array size. The array size on any particular core may vary. In many cases the array 
boundaries are omitted if they are irrelevant to the context of the description. 

When used as part of a sentence, signal names are enclosed in brackets 6 [ ]\ This helps to dis- 
criminate signal names from the words in the sentence. 
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1.6 WISHBONE Logo 

The WISHBONE logo can be affixed to SoC documents that are compatible with this standard. 
Figure 1-3 shows the logo. 




Figure 1-3. WISHBONE logo. 



Do™ntede^ibfng a WISHBONE compatible SoC component that are 100% compliant with 
this standard, MAY use the WISHBONE logo. 



1.7 Glossary of Terms 

Th^O^prefix^nto a hexadecimal number. It is the same nomenclature as commonly used 
in the 'C programming language. 

Active High Logic State . , ... 

A logic stote that is 'true' when the logic level is a binary '1'. The high state is at a higher volt- 
age than the low state. 

Active Low Logic State , , . 

A logic state that is 'true' when the logic level is a binary '0'. The low state is at a lower voltage 

than the high state. 
ASIC 

Acronym for: Application Specific Integrated Circuit. General term which describes a generic 
array of logic gates or analog building blocks which are programmed by a totalization layer at a 
silicon foundry. High level circuit descriptions are impressed upon the logic gates or analog 
building blocks in the form of metal interconnects. 

(1) A verb indicating that a logic state has switched from the inactive to the active state. When 
active high logic is used it means that a signal has switched from a logic low level to a logic high 
level. (2) Assert: to cause a signal line to make a transition from its logically false (inactive) 
state to its logically true (active) state. Opposite of negated. 



WISHBONE SoC Architecture Specification, Revision B.2 



16 



Bus 

(1) A common group of signals. (2) A signal line or a set of lines used by a data transfer system 
to connect a number of devices. 

Bus Interface 

An electronic circuit that drives or receives data or power from a bus. 
Bus Cycle 

The process whereby digital signals effect the transfer of data across a bus by means of an inter- 
locked sequence of control signals. Also see: Phase (bus cycle). 

Crossbar Interconnect (Switch) 

Crossbar switches are mechanisms that allow modules to connect and communicate. Each con- 
nection channel can be operated in parallel to other connection channels. This increases the data 
transfer rate of the entire system by employing parallelism. Stated another way, two 100 
Mbyte/second channels can operate in parallel, thereby providing a 200 Mbyte/second transfer 
rate. This makes the crossbar switches inherently faster than traditional bus schemes. Crossbar 
routing mechanisms generally support dynamic configuration. This creates a configurable and 
reliable network system. Most crossbar architectures are also scalable, meaning that families of 
crossbars can be added as the needs arise. A crossbar interconnection is shown in Figure 1-4. 
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Figure 1-4. Crossbar (switch) interconnection. 



Data Flow Interconnection 

An interconnection where data flows through a prearranged set of IP cores in a sequential order. 
Data flow architectures often have the advantage of parallelism, whereby two or more functions 
are executed at the same time. Figure 1-5 shows a data flow interconnection between IP cores. 



WISHBONE SoC Architecture Specification, Revision B.2 



17 




DIRECTION OF DATA FLOW 



Figure 1-5. Data flow interconnection. 



Data Organization 

The ordering of data during a transfer. Generally, 8-bit (byte) data can be stored with the most 
significant byte of a mult-byte transfer at the higher or the lower address. These two methods are 
generally called BIG ENDIAN and LITTLE ENDIAN, respectively. In general, BIG ENDIAN 
refers to byte lane ordering where the most significant byte is stored at the lower address. LIT- 
TLE ENDIAN refers to byte lane ordering where the most significant byte is stored at the higher 
address. The terms BIG ENDIAN and LITTLE ENDIAN for data organization was coined by 
Danny Cohen of the Information Sciences Institute, and was derived from the book Gulliver's 
Travels by Jonathan Swift. 

DMA Unit , _ . . . 

Acronym for Direct Memory Access Unit. (1) A device that transfers data from one location in 
memory to another location in memory. (2) A device for transferring data between a device and 
memory without interrupting program flow. (3) A device that does not use low-level instruc- 
tions. Intended for transferring data between memory or I/O locations. 

ENDIAN 

See the definition under 'Data Organization'. 
FIFO 

Acronym for: First In First Out. A type of memory used to transfer data between ports on two 
devices. In FIFO memories, data is removed in the same order that they were added. The FIFO 
memory is very useful for interconnecting cores of differing speeds. 



Firm Core . , . . 

An IP Core that is delivered in a way that allows conversion into an integrated circuit design, but 
does not allow the design to be easily reverse engineered. It is analogous to a binary or object 
file in the field of computer software design. 
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Fixed Interconnection 

An interconnection system that is fixed, and cannot be changed without causing incompatibilities 
between bus modules (or SoC/IP cores). Also called a static interconnection. Examples of fixed 
interconnection buses include PCI, cPCI and VMEbus. Also see: variable interconnection. 

Fixed Timing Specification 

A timing specification that is based upon a fixed set of rules. Generally used in traditional mi- 
crocomputer buses like PCI and VMEbus. Each bus module must conform to the ridged set of 
timing specifications. Also see: variable timing specification. 

Foundry 

See silicon foundry. 
FPGA 

Acronym for: Field Programmable Gate Array. Describes a generic array of logical gates and 
interconnect paths which are programmed by the end user. High level logic descriptions are im- 
pressed upon the gates and interconnect paths, often in the form of IP Cores. 

Full Address Decoding 

A method of address decoding where each SLAVE decodes all of the available address space. 
For example, if a 32-bit address bus is used, then each SLAVE decodes all thirty-two address 
bits. This technique is used on standard microcomputer buses like PCI and VMEbus. Also see: 
partial address decoding. 

Gated Clock 

A type of SYSCON interface where clock signal [CLK_0] can be stopped and restarted. The 
signal is always stopped in its low state. This technique is often used to reduce the power con- 
sumption of an integrated circuit. Under WISHBONE, the gated clock generator is optional. 
Also see: variable clock generator. 

Glue Logic 

(1) Logic gates and interconnections required to connect IP cores together. The requirements for 
glue logic vary greatly depending upon the interface requirements of the IP cores. (2) A family 
of logic circuits consisting of various gates and simple logic elements, each of which serve as an 
interface between various parts of a computer system. 

Granularity 

The smallest unit of data transfer that a port is capable of transferring. For example, a 32-bit port 
can be broken up into four 8-bit BYTE segments. In this case, the granularity of the interface is 
8-bits. Also see: port size and operand size. 

Hard Core 

An IP Core that is delivered in the form of a mask set (i.e. a graphical description of the features 
and connections in an integrated circuit). 
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Hardware Description Language (HDL) v^rilnota m 

(1) Acronym for: Hardware Description Language. Examples include VHDL and Venlog®. (2) 
A general-purpose language used for the design of digital electronic systems. 

A combTnation of signals and data-ports on a module which are capable of either generating re- 
ceiving, controlling^ interconnecting IP cores. WISHBONE 

SLAVE, INTERCON and SYSCON interfaces respectively. Also see. MASTER, SLA vh, w 
TERCON and SYSCON. 

A WISHBONE interface that interconnects MASTER, SLAVE and SYSCON interfaces. 
Acronym for: Intellectual Property Core. Also see: soft core, firm core and hard core. 
A graphical description of the features and connections in an integrated circuit. 

^ISHBONE interface that is capable of generating bus cycles. All systems based ^on fte 
WISHBONE interconnect must have at least one MASTER interface. Also see. SLAVE, 
SYSCON and INTERCON. 

SSatSL ,0 be s,ored and recaUed in money a, individual binary ad- 
dresses. 

fS^^^^n^ or other software development tools remove unused 
This isTmportant in WISHBONE because there are optional signals defined on many of 
ftf mterfaces 7a signal is unused, then the logic minimization tools will remove these signals 
and their associated logic, thereby making a faster and more efficient design. 

Module m 

In the context of this specification, it's another name for an IP core. 

r"^«Sa™..ip.exo re to route address, data and conrrol signars. Often used 
for System-on-Chip (SoC) applications. Also see: three-state bus Mercotmectton. 



A^erbtndicating that a logic state has switched from the active to the inactive state. When ac- 
tivctgltgtc "used i. means that a signal has switched from a logic high level to a logtc low 
level. Also see: asserted. 
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Off-Chip Interconnection 

An off-chip interconnection is used when a WISHBONE interface extends off-chip. See Figure 
1-6. 
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Figure 1-6. Off-chip interconnection. 



Operand Size 

The operand size is the largest single unit of data that is moved through an interface. For exam- 
ple, a 32-bit DWORD operand can be moved through an 8-bit port with four data transfers. Also 
see: granularity and port size. 

Parametric Core Generator 

A software tool used for the generation of IP cores based on input parameters. One example of a 
parametric core generator is a DSP filter generator. These are programs that create lowpass, 
bandpass and highpass DSP filters. The parameters for the filter are provided by the user, which 
causes the program to produce the digital filter as a VHDL or Verilog® hardware description. 
Parametric core generators can also be used create WISHBONE interconnections. 

Partial Address Decoding 

A method of address decoding where each SLAVE decodes only the range of addresses that it 
requires. For example, if the module needs only four addresses, then it decodes only the two 
least significant address bits. The remaining address bits are decoded by the interconnection 
system. This technique is used on SoC buses, and has the advantages of: less redundant logic in 
the system, it supports variable address buses, it supports variable interconnection buses and is 
relatively fast. Also see: full address decoding. 

PCI 

Acronym for: Peripheral Component Interconnect. Generally used as an interconnection scheme 
between integrated circuits. It also exists as a board level interconnection known as Compact 
PCI (or cPCI). While this specification is very flexible, it isn't practical for SoC applications. 

Phase (Bus Cycle) 

A periodic portion of a bus cycle. For example, a WISHBONE BLOCK READ cycle could 
contain ten phases, with each phase transferring a single 32-bit word of data. Collectively, the 
ten phases form the BLOCK READ cycle. 
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0/t! "iSSSSnS *- SUPP0«S a single W 1S HBONE MASTER and a sing 
WISHBONE SLAVE interface. It is the simplest way to connect two cores. See Figure w 
A connection with only two endpoints. 
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Figure 1-7. Point to point interconnection. 



Port Size 

The width of the WISHBONE data ports in bits. Also see: granularity and operand size. 

Software tool that physically routes interconnection paths between logic gates. Applies to both 
FPGA and ASIC devices. 



(1) Register-transfer logic. A design methodology that moves data between registers. Data is 
atched in the registers at one or more stages along the path of signal propagation The WISH- 
BONE specification uses a synchronous RTL design methodology that requires that each register 
be clocked with a common clock. (2) Register-transfer level. A description of computer opera- 
tions where data transfers from register to register, latch to latch and through logic gates. (3) A 
level of description of a digital design in which the clocked behavior of the design is expressly 
described in terms of data transfers between storage elements, which may be implied and com- 
binational logic, which may represent any computing or arithmetic-logic-unit logic. RTL mod- 
eling allows design hierarchy that represents a structural description of other RTL models. 

Shared Bus Interconnection , 

The shared bus interconnection is a system where a MASTER initiates addressable bus cycles to 
a target SLAVE. Traditional buses such as VMEbus and PCI bus use this type of interconnec- 
tion As a consequence of this architecture, only one MASTER at a time can use the intercon- 
nexion resource (i.e. bus). Figure 1-8 shows an example of a WISHBONE shared bus inter- 
connection. 
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Figure 1-8. Shared bus interconnection. 



WISHBONE SoC Architecture Specification, Revision B. 



22 



Silicon Foundry 

A factory that produces integrated circuits. 
SLAVE 

A WISHBONE interface that is capable of receiving bus cycles. All systems based on the 
WISHBONE interconnect must have at least one SLAVE. Also see: MASTER, SYSCON and 
INTERCOM 

Soft Core 

An IP Core that is delivered in the form of a hardware description language or schematic dia- 
gram. 

SoC 

Acronym for System-on-Chip. Also see: System-on-Chip. 
Structured Design 

(1) A popular method for managing complex projects. Often used with large project teams. 
When structured design practices are used, individual team members build and test small parts of 
the design with a common set of tools. Each sub-assembly is designed to a common standard. 
When all of the sub-assemblies have been completed, the full system can be integrated and 
tested. This approach makes it much easier to manage the design process. (2) Any disciplined 
approach to design that adheres to specified rules based on principles such as modularity and 
top-down design. 

Switched Fabric Interconnection 

A type of interconnection that uses large numbers of crossbar switches. These are organized into 
arrays that resemble the threads in a fabric. The resulting system is a network of redundant in- 
terconnections. 

SYSCON 

A WISHBONE module and interface. The SYSCON module is the only module in the design 
which may contain the SYSCON interface. The SYSCON interface is the only interface in the 
design which may drive the system clock [CLK_0] and reset [RSTJ3] signals. 

System-on-Chip (SoC) 

A method by which whole systems are created on a single integrated circuit chip. In many cases, 
this requires the use of IP cores which have been designed by multiple IP core providers. Sys- 
tem-on-Chip is similar to traditional microcomputer bus systems whereby the individual compo- 
nents are designed, tested and built separately. The components are then integrated to form a 
finished system. 

Target Device 

The semiconductor type (or technology) onto which the IP core design is impressed. Typical 
examples include FPGA and ASIC target devices. 
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Three-State Bus Interconnection 

A microcomputer bus interconnection that relies upon three-state bus drivers. Often used I tc > re- 
duce the number of interconnecting signal paths through connector and C pins. Three state 
buffers can assume a logic low state ('0' or <L'), a logic high state (T or H ) or a high imped- 
ance state Three-state buffers are sometimes called Tri-State® buffers. Tri-State® is a reg.s- 
tered trademark of National Semiconductor Corporation. Also see: multiplexor interconnection. 

I a type of^vfcONTnterface where the frequency of [CLK_0] can be changed dynamically. 
The frequency can be changed by way of a programmable phase-lock-loop (PLL) circuit or other 
control mechanism. This technique is used to reduce the power consumption of the circuit, i ne 
variable clock generator capability is optional. Also see: gated clock generator and variable 
timing specification. 

Variable Interconnection *:u:i:*;«o 
A microcomputer bus interconnection that can be changed without causing incompatibilities 
between bus modules (or SoC/IP cores). Also called a dynamic interconnection. An example ot 
a variable interconnection bus is the WISHBONE SoC architecture. Also set: fixed mtercon- 
nection. 

l^^STno, fixed, in WISHBONE, variable timing can be achieved in a 
miS of ways. For example, the system integrator can select the SYSCON frequency rate of 
1CLK 01 by enforcing a timing specification during the circuit design. Variable timing can also 
U achieved during circuit operation with a variable clock generator. Also see: gated clock gen- 
erator and variable clock generator. 

AtexS based hardware description language (HDL) intended for use in circuit design. The 
Verilog® language is both a synthesis and a simulation tool. Verilog® was originally a propne- 
tar^ language first conceived in 1983 at Gateway Design Automation (Acton MA), and was later 
refined by Cadence Corporation. It has since been greatly expanded and refined, and much of it 
has been placed into the public domain. Complete descriptions of the language can be found in 
the IEEE 1364 specification. 

I?ron L ym for: VHSIC Hardware Description Language. [VHSIC: Very High Speed Integrated 
Circuit]. A textual based computer language intended for use in circuit design. The VHDL lan- 
guage is both a synthesis and a simulation tool. Early forms of the language emerged from US 
Dept. of Defense ARPA projects in the 1960's, and have since been greatly expanded and re- 
fined. Complete descriptions of the language can be found in the IEEE 1076, IEEE 1U7J.3, 
IEEE 1 164 specifications. 

Acffnym for: Versa Module Eurocard bus. A popular microcomputer (board) bus. While this 
specification is very flexible, it isn't practical for SoC applications. 
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WISHBONE DATASHEET 

A type of documentation required for WISHBONE compatible IP cores. This helps the end user 
understand the detailed operation of the core, and how to connect it to other cores. The WISH- 
BONE DATASHEET can be included as part of an IP core technical reference manual, or as part 
of the IP core hardware description. 

WISHBONE Signal 

A signal that is defined as part of the WISHBONE specification. Non-WISHBONE signals can 
also be used on the IP core, but are not defined as part of this specification. For example, 
[ACK_0] is a WISHBONE signal, but [CLK 1 00_I] is not. 

WISHBONE Logo 

A logo that, when affixed to a document, indicates that the associated SoC component is com- 
patible with the WISHBONE standard. 

Wrapper 

A circuit element that converts a non-WlSHBONE IP Core into a WISHBONE compatible IP 
Core. For example, consider a 16-byte synchronous memory primitive that is provided by an IC 
vendor. The memory primitive can be made into a WISHBONE compatible SLAVE by layering 
a circuit over the memory primitive, thereby creating a WISHBONE compatible SLAVE. A 
wrapper is analogous to a technique used to convert software written in 'C to that written in 
'C++'. 



1.8 References 

IEEE 100: The Authoritative Dictionary of IEEE Standards Terms. Seventh Edition . IEEE Press 
2000. 
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Chapter 2 - Interface Specification 

This chapter describes the signaling method between MASTER, SLAVE, SYSCON and 1N- 
TERCON interfaces. This includes numerous options which may or may not be present on a 
particular interface. Furthermore, it describes a minimum level of required documentation that 
must be created for each IP core. 



2.1 Required Documentation for IP Cores 

WISHBONE compatible IP cores must include documentation that describes the interface. This 
helps the end user understand the operation of the core, and how to connect it to other cores. 
This documentation takes the form of a WISHBONE DATASHEET. It can be included as part 
of the IP core technical reference manual, it can be embedded in source code or it can take other 
forms as well. 



2.1.1 General Requirements for the WISHBONE DATASHEET 
RULE 2.00 

Each WISHBONE compatible IP core MUST include a WISHBONE DATASHEET as part of 
the IP core documentation. 



RULE 2.05 

The WISHBONE DATASHEET MUST include the revision level of the WISHBONE specifi- 
cation to which it was designed. 



RULE 2.10 

The WISHBONE DATASHEET MUST indicate whether it is a MASTER, SLAVE, SYSCON 
or INTERCON interface. Furthermore, it MUST indicate the types of bus cycles it supports. 



RULE 2.15 

The WISHBONE DATASHEET for MASTER, SLAVE and INTERCON interfaces MUST in- 
clude the following information: 

(1) If a MASTER supports the optional [ERR_I] signal, then the WISHBONE DA- 
TASHEET MUST describe how it reacts in response to the signal. If a SLAVE sup- 
ports the optional [ERR_0] signal, then the WISHBONE DATASHEET MUST de- 
scribe the conditions under which the signal is generated. 

(2) If a MASTER supports the optional [RTY I] signal, then the WISHBONE DA- 
TASHEET MUST describe how it reacts in response to the signal. If a SLAVE sup- 
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ports the optional [RTY O] signal, then the WISHBONE DATASHEET MUST de- 
scribe the conditions under which the signal is generated. 

(3) All interfaces that support the [TAGNJ] or [TAGN_0] signal(s) MUST describe 
their use in the WISHBONE DATASHEET. 

(4) The WISHBONE DATASHEET MUST indicate the port size. The port size MUST 
be indicated as: 8-bit, 1 6-bit, 32-bit or 64-bit. 

(5) The WISHBONE DATASHEET MUST indicate the port granularity. The granular- 
ity MUST be indicated as: 8-bit, 1 6-bit, 32-bit or 64-bit. 

(6) The WISHBONE DATASHEET MUST indicate the maximum operand size. The 
maximum operand size MUST be indicated as: 8-bit, 16-bit, 32-bit or 64-bit. If the 
maximum operand size is unknown, then the maximum operand size shall be the 
same as the granularity. 

(7) The WISHBONE DATASHEET MUST indicate the data transfer ordering. The or- 
dering MUST be indicated as BIG ENDIAN or LITTLE ENDIAN. When the port 
size equals the granularity, then the interface shall be specified as BIG ENDIAN 
and/or LITTLE ENDIAN. [When the port size equals the granularity, then BIG EN- 
DIAN and LITTLE ENDIAN transfers are identical]. 

(8) The WISHBONE DATASHEET MUST indicate the sequence of data transfer 
through the port. If the sequence of data transfer is not known, then the datasheet 
MUST indicate that it is undefined. 

(9) The WISHBONE DATASHEET MUST indicate if there are any constraints on the 
[CLK_I] signal. These constraints include (but are not limited to) clock frequency, 
application specific timing constraints, the use of gated clocks or the use of variable 
clock generators. 

2.1.2 Signal Naming 
RULE 2.20 

Signal names MUST adhere to the rules of the native tool in which the IP core is designed. 



PERMISSION 2.00 

Any signal name MAY be used to describe the WISHBONE signals. 



OBSERVATION 2.00 

Most hardware description languages (such as VHDL or Verilog®) have naming conventions. 
For example, the VHDL hardware description language defines the alphanumeric symbols which 
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m ay be used. Furthermore, it states that UPPERCASE and LOWERCASE characters may be 
used in a signal name. 



RECOMENDATION 2.00 

It is recommended that the interface use the signal names defined in this document. 



S I sfmplified if the signal names match those given in this specificate. How- 
ever n som ^ ca es Tuch as IP cores with multiple WISHBONE interconnects) they cannot be 
uled lZ7«lw signal names will not result in any serious integrate problems 
since all hardware description tools allow signals to be renamed. 



^WISHBONE DATASHEET MUST include the signal names that are defined for a WISH- 
BONES™ interface. If a signal name is different than defined in this specification then it 
MUST be LtreLnced to the corresponding signal name which is used in this specification. 

2.1.3 Logic Levels 

RULE 2.30 . .... . 

All WISHBONE interface signals MUST use active high logic. 

SS A S°Ji" 0 «tive low signals does not present a problem. However, RULE 2.30 is 
^SSSt^nc tools (especially schematic entry tools) do not have a — -y o 
indicating an active low signal. For example, a reset signal could be named [#RST J] [/RST J\ 
or ^ RST II This was found to cause confusion among users and incompatibility between 
mod^lef fhis constraint should not create any undue difficulties, as the system integrator can 
invert any signals before use by the WISHBONE interface. 

PERMISSION 2.05 

Non-WISHBONE signals MAY be used with IP core interfaces. 

SSSSS?2S mdude non-WISHBONE signals. These are outside ^^^^ 
ration and no attempt is made to govern them. For example, a disk controller IP core could 
hav a SSSSSS^ce on one end and a disk interface on the other. In this case the spec- 
fication does not dictate any technical requirements for the disk interface signals. 
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OBSERVATION 2.20 

[TAGNI] and [TAGN O] are user defined signals that must adhere to the timing specifications 
given in this document. 

2.2 WISHBONE Signal Description 

This section describes the signals used in the WISHBONE interconnect. Some of these signals 
are optional, and may or may not be present on a specific interface. 

2.2.1 SYSCON Signals 
CLK_0 

The system clock output [CLK_0] is generated by the SYSCON interface. It coordinates all ac- 
tivities for the internal logic within the WISHBONE interconnect. The INTERCON connects the 
[CLK_0] output to the [CLK_I] input on MASTER and SLAVE interfaces. 



RST_0 

The reset output [RST O] is generated by the SYSCON interface. It forces all WISHBONE in- 
terfaces to restart. All internal self-starting state machines are forced into an initial state. The 
INTERCON connects the [RST_0] output to the [RST_I] input on MASTER and SLAVE inter- 
faces. 



2.2.2 Signals Common to MASTER and SLAVE Interfaces 
CLK_I 

The clock input [CLK_I] coordinates all activities for the internal logic within the WISHBONE 
interconnect. All WISHBONE output signals are registered at the rising edge of [CLK_I]. All 
WISHBONE input signals must be stable before the rising edge of [CLK_I]. 



RST_I 

The reset input [RST I] forces the WISHBONE interface to restart. Furthermore, all internal 
self-starting state machines will be forced into an initial state. This signal only resets the 
WISHBONE interface. It is not required to reset other parts of an IP core (although it may be 
used that way). 



TAGNI 

The tag input(s) [TAGNJ] are user defined, and are used with either MASTER or SLAVE inter- 
faces. 'N' in this signal name refers to a tag number because multiple tags may be used (e.g. 
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[TAGS 0) Tag inputs are used whenever an IP eore needs specific formation from the inter- 
Snnecuon. For example, a MASTER can be designed to momtor the state of a FIFO. 

Thf to N g-ou,pu,(s, [TAON.O, are user d efine* a^e J^^^^ 
core provider in the WISHBONE DATASHEET. 



2.2.3 MASTER Signals 



Siowledge input [ACKJ], when asserted, indicates the termination of a normal bus cycle. 
Also seethe [ERRJ] and [RTY l] signal descriptions. 

ADR_O(63..0) bi address with t he most signifi- 

The address output array [ADR_O(63..0)j is , usea £P*> J Th j may boundary is 

FIFO interfaces) the array may not be present on the interlace. 

C , YC -°, * * rrvr Ol when asserted indicates that a valid bus cycle is in progress. The 
The cycle output [CYC_0], when assenea, inu . d ■ BLOCK transfer cycle 

signal is asserted for the duration of all! ^.^^^^^^ the first data trans- 
is held until [CYC_0] is negated. 

DAT I(63..0) data array boundaries are de- 

JSS [ D e AT_O P (63..0 ) ] and [S EL_O(7..0)l signal descriptions. 

DAT_O(63..0) . d to Tne ^ boundaries are de- 



30 

WISHBONE SoC Architecture Specification, Revision B.2 



ERRI 

The error input [ERR_I] indicates an abnormal cycle termination. The source of the error, and 
the response generated by the MASTER is defined by the IP core supplier. Also see the 
[ACK I] and [RTY_I] signal descriptions. 



RTYI 

The retry input [RTY_I] indicates that the interface is not ready to accept or send data, and that 
the cycle should be retried. When and how the cycle is retried is defined by the IP core supplier. 
Also see the [ERR_I] and [RTY_I] signal descriptions. 



SEL_O(7..0) 

The select output array [SEL_O(7..0)] indicates where valid data is expected on the 
[DAT_I(63..0)] signal array during READ cycles, and where it is placed on the [DAT_O(63..0)] 
signal array during WRITE cycles. Also see the [DAT_I(63..0)], [DAT_O(63..0)] and [STBJ3] 
signal descriptions. 



STB_0 

The strobe output [STB O] indicates a valid data transfer cycle. It is used to qualify various 
other signals on the interface such as [SEL_O(7..0)]. The SLAVE must assert either the 
[ACK_I], [ERR_I] or [RTY_I] signals in response to every assertion of the [STB_0] signal. 



WEJ) 

The write enable output [WE_0] indicates whether the current local bus cycle is a READ or 
WRITE cycle. The signal is negated during READ cycles, and is asserted during WRITE cycles. 



2.2.4 SLAVE Signals 
ACK_0 

The acknowledge output [ACK_0], when asserted, indicates the termination of a normal bus cy- 
cle. Also see the [ERR_0] and [RTY_0] signal descriptions. 



ADR_I(63..0) 

The address input array [ADR_I(63..0)] is used to pass a binary address, with the most signifi- 
cant address bit at the higher numbered end of the signal array. The lower array boundary is 
specific to the data port size. The higher array boundary is core-specific. In some cases (such as 
FIFO interfaces) the array may not be present on the interface. 



WISHBONE SoC Architecture Specification, Revision B.2 



31 



CYC I 

The cycle input [CYC I], when asserted, indicates that a valid bus cycle is un j progress. The sig- 
nal is asserted for the duration of all bus cycles. For example, during a BLOCK transfer cycle 
there can be multiple data transfers. The [CYCJ] signal is asserted during the first data transfer, 
and remains asserted until the last data transfer. 

T^dkta inDUt array [DAT I(63..0)] is used to pass binary data. The array boundaries are de- 
*™fned byfoeTort size. Also see the [DAT_O(63..0)] and [SEL_O(7..0)] signal descriptions. 

?h A e T d^ ( oui°ut array [DAT O(63..0)] is used to pass binary data. The array boundaries are de- 
termined by the port size. Also see the [DAT_I(63..0)] and [SEL_O(7..0)] signal descriptions. 

ERR O a 

The error output [ERR O] indicates an abnormal cycle termination. The source of the error, and 
the response generated by the MASTER is defined by the IP core supplier. Also see the 
[ACK_0] and [RTY_0] signal descriptions. 

RTY O 

The retry output [RTY O] indicates that the indicates that the interface is not ready to accept or 
send data, and that the cycle should be retried. When and how the cycle ,s retried is defined by 
the IP core supplier. Also see the [ERR_0] and [RTY O] signal descriptions. 

T^eSfmput array [SEL_1(7..0)] indicates where valid data is placed on the [DAT 1(63 0)] 
sienal arrav during WRITE cycles, and where it should be present on the [DAT_O(63..0)] s gna 
fSg TaI cycles. Also see the [DAT_I(63..0)], [DAT_O(63..0)] and [STBJ] signal 
descriptions. 

Tteitrobe input [STB I], when asserted, indicates that the SLAVE is selected. A SLAVE shall 
respond to other WISHBONE signals only when this [STBJ] is asserted, except for the ^[RST J) 
signal which should always be responded to. The SLAVE must assert either the [ACK_0], 
[ERR_0] or [RTY_0] signals in response to every assertion of the [STBJ J signal. 

^he-write enable input [WE_I] indicates whether the current local J™/*?*^*?™™ 
WRITE cycle. The signal is negated during READ cycles, and is asserted during WRITE cycles. 
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Chapter 3 - Bus Cycles 

WISHBONE bus cycles are described in terms of their general operation, reset operation, hand- 
shaking protocol and the data organization during transfers. Additional requirements for bus cy- 
cles (especially those relating to the common clock) can be found in the timing specifications in 
Chapter 4. 



3.1 General Operation 

Each MASTER and SLAVE are interconnected with a set of signals that permit them to ex- 
change data. For descriptive purposes these signals are cumulatively known as a bus, and are 
contained within a functional module called the INTERCON. Address, data and other informa- 
tion is impressed upon this bus in the form of bus cycles. 



3.1.1 Reset Operation 

All hardware interfaces must be initialized to a pre-defined state. This is accomplished with the 
reset signal [RST_0], which can be asserted at any time. It is also used for test simulation pur- 
poses by initializing all self-starting state machines and counters which may be used in the de- 
sign. The reset signal [RST_0] is driven by the SYSCON interface. It is connected to the 
[RST_I] signal on all MASTER and SLAVE interface. Figure 3-1 shows the reset cycle. 



CLK I 



RST I 



N0TE(2) 

/© N0TE(1) 



J 



™0 W WWW ^K 



NOTE(3) 



A 



NOTES: 

(1) Reset cycles can be extended for any length of time. 

(2) Self -starting state machines & counters reset themselves at 
the rising [OKI] edge following the assertion of [RST_I] . 
On MASTERS, [STB 0] and [CYC 0] are negated at the same time. 

(3) On MASTERS, [STE 0] and [CXC 0] may be asserted at the 
rising [CLK_I] eHge following the negation of [RST_I] . 



Figure 3-1. Reset cycle. 
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All WISHBONE interfaces MUST initialize themselves at the rising [CLK_0] edge following 
the assertion of [RST_0]. They MUST stay in the initialized state until the rising [CLK_0] edge 
that follows the negation of [RST_0]. 



[RSTJ] 3 *MUST be asserted for at least one complete clock cycle on all WISHBONE interfaces. 



PERMISSION 3.00 * • A e •* i 

[RST_0] MAY be asserted for more than one clock cycle, and MAY be asserted indefinitely. 



RULE 3.10 moT _ 

All WISHBONE interfaces MUST be capable of reacting to [RSTJ] at any time. 



RULE 3 15 

All self-starting state machines and counters in WISHBONE interfaces MUST initialize them- 
selves at the rising [CLK_I] edge following the assertion of [RSTJ]. They MUST stay in the 
initialized state until the rising [CLK_I] edge that follows the negation of [RSTJ]. 

OBSERVATION 3.00 tt 

In general, self-starting state machines do not need to be initialized. However, this may cause 
problems because some simulators may not be sophisticated enough to find an initial starting 
point for the state machine. Furthermore, self-starting state machines can go through an inde- 
terminate number of initialization cycles before finding their starting state, thereby making it dif- 
ficult to predict their behavior at start-up time. The initialization rule prevents both problems by 
forcing all state machines to a pre-defined state in response to the assertion of [RSTJ]. 



RULE 3 20 

The following MASTER signals MUST be negated at the rising [CLKJ] edge following the as- 
sertion of [RST I], and MUST stay in the negated state until the rising [CLKJ] edge that fol- 
lows the negatio"n of [RSTJ]: [STB_0], [CYC_0]. The state of all other MASTER signals are 
undefined in response to a reset cycle. 

OBSERVATION 3.05 . . 

On MASTER interfaces [STB_0] and [CYC_0] may be asserted beginning at the rising 

[CLKJ] edge following the negation of [RSTJ]. 
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OBSERVATION 3.10 

SLAVE interfaces automatically negate [ACK_0], [ERR_0] and [RTY_0] when their [STB J] 
is negated. 

RECOMENDATION 3.00 

Design SYSCON circuits so that they assert [RST O] during a power-up condition. [RST_0] 
should remain asserted until all voltage levels and clock frequencies in the system are stabilized. 
When negating [RST_0], do so in a synchronous manner that conforms to this specification. 

OBSERVATION 3.15 

If a gated clock generator is used, and if the clock is stopped, then the WISHBONE interface is 
not capable of responding to its [RST_I] signal. 

SUGGESTION 3.00 

Some circuits require an asynchronous reset capability. If an IP core or other SoC component 
requires an asynchronous reset, then define it as a non-WISHBONE signal. This prevents confu- 
sion with the WISHBONE reset [RST_0] signal, which uses a purely synchronous protocol, and 
need be applied only to the WISHBONE interface. 



OBSERVATION 3.20 

All WISHBONE interfaces must respond to the reset signal. However, the IP Core connected to 
a WISHBONE interface does not necessarily need to respond to the reset signal. 

3.1.2 Handshaking Protocol 

All bus cycles use a handshaking protocol between the MASTER and SLAVE interfaces. As 
shown in Figure 3-2, the MASTER asserts [STB_0] when it is ready to transfer data. [STB_0] 
remains asserted until the SLAVE asserts one of the cycle terminating signals [ACK_I], [ERR I] 
or [RTY I]. At every rising edge of [CLK_I] the terminating signal is sampled. If it is asserted, 
then [STB_0] is negated. This gives both the MASTER and SLAVE interfaces the possibility to 
control the rate at which data is transferred. 

If the SLAVE guarantees it can keep pace with all MASTER interfaces, and if the [ERR_I] and 
[RTYJ] signals are not used, then the [ACKJ] signal may be tied to the SLAVE'S [STB J] in- 
put. The interface will function normally under these circumstances. 
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Figure 3-2. Local bus handshaking protocol. 



Most of the examples in this specification describe the use of [ACKJ] to terminate a local bus 
cycle. However, the SLAVE can optionally terminate the cycle with an error [ERR_0], or re- 
quest that the cycle be retried [RTY_0]. 

All interfaces include the [ACKJ] terminator signal. Asserting this signal during a bus cycle 
causes it to terminate normally. 

Asserting the [ERR_I] signal during a bus cycle will terminate the cycle. It also serves to notify 
the MASTER that an error occurred during the cycle. This signal is generally used if an error 
was detected by SLAVE logic circuitry. For example, if the SLAVE is a parity-protected mem- 
ory, then the [ERR_I] signal can be asserted if a parity fault is detected. This specification does 
not'dictate what the MASTER will do in response to [ERR_I]. 

Asserting the optional [RTY_I] signal during a bus cycle will terminate the cycle. It also serves 
to notify the MASTER that the current cycle should be aborted, and retried at a later time. This 
signal is generally used for shared memory and bus bridges. In these cases SLAVE circuitry 
would assert [RTY_I] if the local resource is busy. This specification does not dictate when or 
how the MASTER will respond to [RTY I]. 



As a minimum, the MASTER interface MUST include the following signals: [ACKJ], [CLK J], 
[CYC J)], [RSTJ] and [STB_0]. As a minimum, the SLAVE interface MUST include the fol- 
lowing signals: [ACK_0], [CLK J] and [RSTJ]. All other signals are optional. 



PERMISSION 3.05 . 

MASTER and SLAVE interfaces MAY be designed to support the [ERR J] and [ERRJJJ sig- 
nals In these cases, the SLAVE asserts [ERR O] to indicate that an error has occurred during 
the bus cycle. This specification does not dictate what the MASTER does in response to 
[ERR J]. 
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PERMISSION 3.10 

MASTER and SLAVE interfaces MAY be designed to support the [RTYJ] and [RTYO] sig- 
nals. In these cases, the SLAVE asserts [RTY O] to indicate that the interface is busy, and that 
the bus cycle should be retried at a later time. This specification does not dictate what the 
MASTER will do in response to [RTYJ]. 



RULE 3.30 

If a SLAVE supports the [ERRO] or [RTY_0] signals, then the SLAVE MUST NOT assert 
more than one of the following signals at any time: [ACK_0], [ERR O] or [RTY_0]. 



RECOMMENDATION 3.00 

Design WISHBONE MASTER interfaces so that there are no intermediate logic gates between a 
registered flip-flop and the signal outputs on [STB_0] and [CYC_0]. Delay timing for 
[STB_0] and [CYC_0] are very often the most critical paths in the system. This prevents 
sloppy design practices from slowing down the interconnect because of added delays on these 
two signals. 



RULE 3.35 

SLAVE interfaces MUST be designed so that the [ACK_0], [ERR O] and [RTY O] signals are 
asserted and negated in response to the assertion and negation of [STB_I]. Furthermore, this ac- 
tivity MUST occur asynchronous to the [CLKI] signal (i.e. there is a combinatorial logic path 
between [STBJ] and [ACK_0], etc.). 



OBSERVATION 3.25 

The asynchronous logic requirement assures that the interface can accomplish one data transfer 
per clock cycle. Furthermore, it simplifies the design of arbiters in multi-MASTER applications. 

RECOMMENDATION 3.05 

Design interconnection logic to prevent deadlock conditions when MASTER accesses are made 
to unused address locations. One solution to this problem is to include a watchdog timer func- 
tion that monitors the MASTER'S [STB_0] signal, and asserts [ERR_I] or [RTYJ] if the cycle 
exceeds some pre-defined time limit. 



PERMISSION 3.15 

Under certain circumstances SLAVE interfaces MAY be designed to hold [ACK O] in the as- 
serted state. This situation occurs on point-to-point interfaces where there is a single SLAVE on 
the interface, and that SLAVE always operates without wait states. 
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MASTER interfaces MUST be designed to operate normally when SLAVE interface holds 
[ACKI] in the asserted state. 



3.1.3 Use of [STB_0] 

MASTCR fnterfaces MUST qualify the following signals with [STB_0]: [ADR J)], [DAT_0()], 
[SELOO], [WE_0], [SEL_0] and [TAGNO]. 

MASTER interfaces MUST assert [CYC_0] for the duration of SINGLE READ / WRITE, 
BLOCK and RMW cycles. [CYC_0] MUST be asserted no later than the rising [CLK_I] edge 
that qualifies the assertion of [STB_0]. [CYC_0] MUST be negated no earlier than the rising 
[CLK I] edge that qualifies the negation of [STB O]. 

3.1.4 Use of [ACK_0], [ERR_0] and [RTY_0] 
RULE 3 55 

SLAVE interfaces MUST qualify the following signals with [ACK_0], [ERR_0] or [RTY O]: 
[DAT_0()]. 

3.1.5 Use of [TAGNJ] and [TAGN_0] Signals 

The TAG signals [TAGNJ] and [TAGN_0] are used by both the MASTER and SLAVE inter- 
faces. They are used for three purposes: (a) to tag data with information such as parity or time 
stamps, (b) to identify specialty bus cycles (like interrupts or cache control operations) and (c) to 
communicate with the bus interconnection. These signals are user defined. 

For example, the designer of a MASTER may wish to add parity check bits to its bus cycle. In 
this case a [TAGN_0] signal is defined by the IP core designer, and logic would be created to 
generate the bit. Furthermore, the signal would be described in the WISHBONE DATASHEET. 

In another example, the designer of a SLAVE interface may wish to notify the bus interconnec- 
tion logic with the size of it's data interface. In this case a [TAGN_0] signal is defined by the IP 
core designer, and logic would be created to reflect the bus size. The signal would also be de- 
scribed in the WISHBONE DATASHEET. 



RULE 3.60 .„•••«!.• 
The [TAGNJ] and [TAGN_0] signals MUST adhere to the timing specifications given in this 

document. 
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3.2 SINGLE READ / WRITE Cycles 

The SINGLE READ / WRITE cycles perform one data transfer at a time. These are the basic 
cycles used to perform data transfers on the WISHBONE interconnect. 



RULE 3.65 

All MASTER and SLAVE interfaces that support SINGLE READ or SINGLE WRITE cycles 
MUST conform to the timing requirements given in sections 3.2.1 and 3.2.2. 



PERMISSION 3.20 

MASTER and SLAVE interfaces MAY be designed so that they do not support the SINGLE 
READ or SINGLE WRITE cycles. 
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3.2.1 SINGLE READ Cycle 

Figure 3-3 shows a SINGLE READ cycle. The bus protocol works as follows: 

CLOCK EDGE 0: MASTER presents [ADR_0()] and [TAGN O], 

MASTER negates [WE_0] to indicate a READ cycle. 

MASTER presents bank select [SEL_0()] to indicate where it expects data. 

MASTER asserts [CYC_0] to indicate the start of the cycle. 

MASTER asserts [STB_0] to qualify [ADR_0()], [SEL_0()] and [WE_0]. 

SETUP, EDGE 1 : SLAVE decodes inputs, and responds by asserting [ACK_I]. 
SLAVE presents valid data on [DAT_I()]. 

SLAVE asserts [ACKJ] in response to [STB_0] to indicate valid data. 
SLAVE presents [TAGN_0]. 
MASTER monitors [TAGNJ]. 

MASTER monitors [ACKJ], and prepares to latch data on [DAT_1()]. 

Note: SLAVE may insert wait states (-WSS-) before asserting [ACKJ], 
thereby allowing it to throttle the cycle speed. Any number of wait states 
may be added. 

CLOCK EDGE 1 : MASTER latches data on [DATJ()]. 

MASTER latches [TAGNJ]. 

MASTER negates [STB_0] and [CYC_0] to indicate the end of the cycle. 
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Figure 3-3. SINGLE READ cycle. 
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3.2.2 SINGLE WRITE Cycle 

Figure 3-4 shows a SINGLE WRITE cycle. The bus protocol works as follows: 

CLOCK EDGE 0: MASTER presents [ADR_0()] and [TAGN O]. 

MASTER asserts [WE_0] to indicate a WRITE cycle. 

MASTER presents bank select [SEL_0()] to indicate where it sends data. 

MASTER asserts [CYC_0] to indicate the start of the cycle. 

MASTER asserts [STB_0] to qualify [ADR_0()], [SEL_0()] and [WE_0]. 

SETUP, EDGE 1: SLAVE decides inputs, and responds by asserting [ACK_I]. 

SLAVE presents prepares to latch data on [DAT_0()]. 

SLAVE asserts [ACK_I] in response to [STB_0] to indicate latched data. 

SLAVE presents [TAGN_0]. 

MASTER monitors [TAGNJ]. 

MASTER monitors [ACKJ], and prepares to terminate the cycle. 

Note: SLAVE may insert wait states (-WSS-) before asserting [ACKJ], 
thereby allowing it to throttle the cycle speed. Any number of wait states 
may be added. 

CLOCK EDGE 1: SLAVE latches data on [DAT_0()]. 

MASTER latches [TAGNJ]. 

MASTER negates [STB_0] and [CYC JD] to indicate the end of the cycle. 
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Figure 3-4. SINGLE WRITE cycle. 
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3.3 BLOCK READ / WRITE Cycles 

The BLOCK transfer cycles perform multiple data transfers. They are very similar to single 
READ and WRITE cycles, but have a few special modifications to support multiple transters. 

During BLOCK cycles, the interface basically performs SINGLE READ/WRITE cycles as de- 
scribed above. However, the BLOCK cycles are modified somewhat so that these individual cy- 
cles are combined together to form a single BLOCK cycle, ^is function s most useful when 
multiple MASTERS are used on the interconnect. For example if the SLAVE is a shared I (dua 
port) memory, then an arbiter for that memory can determine when one MASTER is done with it 
so that another can gain access to the memory. 

As shown in Figure 3-5, the [CYC_0] signal is asserted for the duration of a BLOCK cycle. 
This signal can be used to request permission to access from a shared resource from a local arbi- 
ter, and hold the access until the end of the current cycle During each °f the data i transfers 
(within the block transfer), the normal handshaking protocol between [STB_0] and [ACK1J is 
maintained. 
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Figure 3-5. Use of TCYC 01 signal during BLOCK cycl es. 



It should be noted that the [CYC_0] signal does not necessarily rise and fall at the same time as 
[STB O] [CYC O] may be asserted at the same time as [STB_0], or one or more [CLK_lj 
edges'before [STBO]. Similarly, [CYC_0] may be negated at the same time as [STBO], or 
after an indeterminate number of [CLK I] cycles. 



RULE 3 70 

All MASTER and SLAVE interfaces that support BLOCK cycles MUST conform to the timing 
requirements given in sections 3.3.1 and 3.3.2. 

PERMISSION 3.25 *u qt nrv ™ 

MASTER and SLAVE interfaces MAY be designed so that they do not support the BLOCK cy- 
cles. 
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3.3.1 BLOCK READ Cycle 



Figure 3-6 shows a BLOCK READ cycle. The BLOCK cycle is capable of a data transfer on 
every clock cycle. However, this example also shows how the MASTER and the SLAVE inter- 
faces can both throttle the bus transfer rate by inserting wait states. A total of five transfers are 
shown. After the second transfer the MASTER inserts a wait state. After the fourth transfer the 
SLAVE inserts a wait state. The cycle is terminated after the fifth transfer. The protocol for this 
transfer works as follows: 

CLOCK EDGE 0: MASTER presents [ADR_0()] and [TAGNO]. 

MASTER negates [WE O] to indicate a READ cycle. 

MASTER presents bank select [SEL_0()] to indicate where it expects data. 

MASTER asserts [CYC_0] to indicate the start of the cycle. 

MASTER asserts [STB_0]. 

Note: the MASTER must assert [CYC_0] and/or [TAGN O] at, or anytime 
before, clock edge 1. The use of [TAGN O] is optional. 

SETUP, EDGE 1: SLAVE decodes inputs, and responds by asserting [ACKI]. 
SLAVE presents valid data on [DATI]. 
SLAVE presents [TAGN_0]. 
MASTER monitors [TAGNI]. 

MASTER monitors [ACK I], and prepares to latch data on [DAT_I()]. 

CLOCK EDGE 1: MASTER latches data on [DAT_I()]. 

MASTER latches [TAGN I]. 

MASTER presents new [ADR_0()] and [TAGN_0]. 

MASTER presents new bank select [SEL_0()] to indicate where data is. 

SETUP, EDGE 2: SLAVE decodes inputs, and responds by asserting [ACK I]. 
SLAVE presents valid data on [DAT I]. 
SLAVE presents [TAGN O]. 
MASTER monitors [TAGNJ]. 

MASTER monitors [ACK I], and prepares to latch data on [DAT_I()]. 

CLOCK EDGE 2: MASTER latches data on [DAT_I()]. 

MASTER latches [TAGN I]. 

MASTER negates [STBO] to introduce a wait state (-WSM-). 

SETUP, EDGE 3: SLAVE negates [ACKJ] in response to [STB_0]. 

Note: any number of wait states can be inserted by the MASTER. 

CLOCK EDGE 3: MASTER presents new [ADR_0()] and [TAGN O]. 

MASTER presents new bank select [SEL_0()]. 
MASTER asserts [STB_0]. 
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SETUP, EDGE 4: SLAVE decodes inputs, and responds by asserting [ACK_I]. 
SLAVE presents valid data on [DAT_I]. 
SLAVE presents [TAGN O]. 
MASTER monitors [TAGN_I]. 

MASTER monitors [ACK_I], and prepares to latch data on [DAT_I()]. 

CLOCK EDGE 4: MASTER latches data on [DAT_I()]. 

MASTER presents [ADR_0()] and [TAGN_0]. 
MASTER latches [TAGNI]. 

MASTER presents new bank select [SEL_0()] to indicate where it expects 
data. 

SETUP, EDGE 5: SLAVE decodes inputs, and responds by asserting [ACKI]. 
SLAVE presents valid data on [D AT_I] . 
SLAVE presents [TAGN_0]. 
MASTER monitors [TAGNJ]. 

MASTER monitors [ACK_I], and prepares to latch data on [DAT_I()]. 

CLOCK EDGE 5: MASTER latches data on [DAT_1()]. 

MASTER latches [TAGN I]. 

SLAVE negates [ACK_I] to introduce a wait state. 

Note: any number of wait states can be inserted by the SLAVE at this point. 

SETUP, EDGE 6: SLAVE decodes inputs, and responds by asserting [ACK I]. 
SLAVE presents valid data on [DAT I] . 

MASTER monitors [ACKJ], and prepares to latch data on [DAT_I()]. 

CLOCK EDGE 6: MASTER latches data on [DAT_I()]. 

MASTER terminates cycle by negating [STB_0] and [CYC_0]. 
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Figure 3-6. BLOCK READ cycle. 
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3.3.2 BLOCK WRITE Cycle 

Figure 3-7 shows a BLOCK WRITE cycle. The BLOCK cycle is capable of a data transfer on 
every clock cycle. However, this example also shows how the MASTER and the SLAVE inter- 
faces can both throttle the bus transfer rate by inserting wait states. A total of five transfers are 
shown. After the second transfer the MASTER inserts a wait state. After the fourth transfer the 
SLAVE inserts a wait state. The cycle is terminated after the fifth transfer. The protocol for this 
transfer works as follows: 

CLOCK EDGE 0: MASTER presents [ADR_0()] and [TAGN_0]. 

MASTER asserts [WE_0] to indicate a WRITE cycle. 
MASTER presents bank select [SEL_0()] to indicate where it expects data. 
MASTER asserts [CYC_0] and [TAGN_0] to indicate cycle start. 
MASTER asserts [STB_0]. 

Note: the MASTER must assert [CYC_0] and/or [TAGNO] at, or anytime 
before, clock edge 1. The use of [TAGN_0] is optional. 

SETUP, EDGE 1: SLAVE decodes inputs, and responds by asserting [ACK_I]. 
SLAVE prepares to latch data on [DATO] . 
SLAVE presents [TAGN_0]. 
MASTER monitors [TAGNI]. 

MASTER monitors [ACK_I], and prepares to terminate current data phase. 

CLOCK EDGE 1: SLAVE latches data on [DAT_0()]. 

MASTER latches [TAGNJ]. 

MASTER presents [ADR_0()] and [TAGN O]. 

MASTER presents new bank select [SEL_0()]. 

SETUP, EDGE 2: SLAVE decodes inputs, and responds by asserting [ACK_I]. 
SLAVE prepares to latch data on [DAT O]. 
SLAVE presents [TAGN_0]. 
MASTER monitors [TAGN I]. 

MASTER monitors [ACK_I], and prepares to terminate current data phase. 

CLOCK EDGE 2: SLAVE latches data on [DAT_0()]. 

MASTER latches [TAGNJ]. 

MASTER negates [STB_0] to introduce a wait state (-WSM-). 
SETUP, EDGE 3: SLAVE negates [ACKJ] in response to [STB_0]. 

Note: any number of wait states can be inserted by the MASTER at this 
point. 

CLOCK EDGE 3: MASTER presents [ADR_0()] and [TAGN O]. 

MASTER presents bank select [SEL_0()] to indicate where it expects data. 
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MASTER asserts [STB_0]. 

SETUP, EDGE 4: SLAVE decodes inputs, and responds by asserting [ACK_I]. 
SLAVE prepares to latch data on [DAT_0] . 
SLAVE presents [TAGNO]. 
MASTER monitors [TAGNJ]. 

MASTER monitors [ACKJ], and prepares to terminate data phase. 

CLOCK EDGE 4: SLAVE latches data on [DAT_0()]. 

MASTER latches [TAGNJ]. 

MASTER presents [ADR_0()] and [TAGN O]. 

MASTER presents new bank select [SEL OO] to indicate where it expects 
data. 

SETUP, EDGE 5: SLAVE decodes inputs, and responds by asserting [ACK_I]. 
SLAVE prepares to latch data on [DATO]. 
SLAVE presents [TAGN O]. 
MASTER monitors [TAGNJ]. 

MASTER monitors [ACKJ], and prepares to terminate data phase. 

CLOCK EDGE 5: SLAVE latches data on [DAT_0()]. 

SLAVE negates [ACK J] to introduce a wait state. 
MASTER latches [TAGNJ]. 

Note: any number of wait states can be inserted by the SLAVE at this point. 

SETUP, EDGE 6: SLAVE decodes inputs, and responds by asserting [ACK J]. 
SLAVE prepares to latch data on [DAT O]. 
MASTER monitors [ACK J], and prepares to terminate data phase. 

CLOCK EDGE 6: SLAVE latches data on [DAT_0()]. 

MASTER terminates cycle by negating [STB JD] and [CYC_0]. 
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Figure 3-7. BT.OCK WRITE cycle. 
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3.4 RMW Cycle 



The RMW (read-modify-write) cycle is used for indivisible semaphore operations. During the 
first half of the cycle a single read data transfer is performed. During the second half of the cycle 
a write data transfer is performed. The [CYC O] signal remains asserted during both halves of 
the cycle. 

RULE 3.75 

All MASTER and SLAVE interfaces that support RMW cycles MUST conform to the timing 
requirements given in section 3.4. 

PERMISSION 3.30 

MASTER and SLAVE interfaces MAY be designed so that they do not support the RMW cy- 
cles. 

Figure 3-8 shows a read-modify-write (RMW) cycle. The RMW cycle is capable of a data trans- 
fer on every clock cycle. However, this example also shows how the MASTER and the SLAVE 
interfaces can both throttle the bus transfer rate by inserting wait states. Two transfers are 
shown. After the first (read) transfer, the MASTER inserts a wait state. During the second trans- 
fer the SLAVE inserts a wait state. The protocol for this transfer works as follows: 

CLOCK EDGE 0: MASTER presents [ADR_0()] and [TAGNO]. 

MASTER negates [WE_0] to indicate a READ cycle. 
MASTER presents bank select [SEL_0()] to indicate where it expects data. 
MASTER asserts [CYC_0] and [TAGN_0] to indicate the start of cycle. 
MASTER asserts [STB_0]. 

Note: the MASTER must assert [CYC_0] and/or [TAGN O] at, or anytime 
before, clock edge 1 . The use of [TAGN O] is optional. 

SETUP, EDGE 1: SLAVE decodes inputs, and responds by asserting [ACK I]. 
SLAVE presents valid data on [DAT I]. 
SLAVE presents [TAGN_0]. 
MASTER monitors [TAGNJ]. 

MASTER monitors [ACK_I], and prepares to latch data on [DAT_I()]. 

CLOCK EDGE 1: MASTER latches data on [DATJO]. 

MASTER latches [TAGNJ]. 

MASTER negates [STB_0] to introduce a wait state (-WSM-). 

SETUP, EDGE 2: SLAVE negates [ACK_I] in response to [STB_0]. 

MASTER asserts [WE_0] to indicate a WRITE cycle. 

Note: any number of wait states can be inserted by the MASTER at this 
point. 
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CLOCK EDGE 2: MASTER presents the same [ADR_0()] and [TAGNO] as was on clock 1 . 
MASTER presents WRITE data on [DAT_0()]. 
MASTER presents new bank select [SEL_0()]. 
MASTER asserts [STB_0]. 

SETUP, EDGE 3: SLAVE decodes inputs, and responds by asserting [ACK_I] (when ready). 
SLAVE presents valid data on [DAT I]. 
SLAVE presents [TAGN_0]. 
MASTER monitors [TAGNI]. 

MASTER monitors [ACK_I], and prepares to latch data on [DAT_I()]. 
Note: any number of wait states can be inserted by the SLAVE at this point. 

CLOCK EDGE 3: SLAVE latches data on [DAT_0()]. 

MASTER latches [TAGN I]. 

MASTER negates [STB_0] and [CYC_0] indicating the end of the cycle. 
SLAVE negates [ACK_I] in response to negated [STB_0]. 
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Figure 3-8. RMW cycle. 
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3.5 Data Organization 

Data organization refers to the ordering of data during transfers. There are two general types of 
ordering which are called BIG ENDIAN and LITTLE END1AN. BIG ENDIAN refers to data 
ordering where the most significant portion of an operand is stored at the lower address. LIT- 
TLE ENDIAN refers to data ordering where the most significant portion of an operand is stored 
at the higher address. The WISHBONE architecture supports both methods of data ordering. 



3.5.1 Nomenclature 

A BYTEfN) WORD(N), DWORD(N) and QWORD(N) nomenclature is used to define data or- 
dering. These terms are defined in Table 3-1. Figure 3-9 shows the operand locations for input 
and output data ports. 



Table 3-1. Data Transfer Nomenclature 


Nomenclature 


Granularity 


Description 


BYTE(N) 


8-bit 


An 8-bit BYTE transfer at address 'N'. 


WORD(N) 


16-bit 


A 16-bit WORD transfer at address 'N'. 


DWORD (N) 


32-bit 


A 32-bit Double WORD transfer at address 'N'. 


0WORD(N) 


64-bit 


A 64-bit Quadruple WORD transfer at address 'N'. 



The table also defines the granularity of the interface. This indicates the minimum unit of data 
transfer that is supported by the interface. For example, the smallest operand that can be passed 
through a port with 16-bit granularity is a 16-bit WORD. In this case, an 8-bit operand cannot be 
transferred. 

Figure 3-10 shows an example of how the 64-bit value of 0x0123456789ABCDEF is transferred 
through BYTE WORD, DWORD and QWORD ports using BIG ENDIAN data organization. 
Throush the 64-bit QWORD port the number is directly transferred with the most significant bit 
at DAT I /DAT 0(63). The least significant bit is at DAT_I / DAT_O(0). However, when the 
same operand is Transferred through a 32-bit DWORD port, it is split into two bus cycles. The 
two bus cycles are each 32-bits in length, with the most significant DWORD transferred at the 
lower address, and the least significant DWORD transferred at the upper address. A similar 
situation applies to the WORD and BYTE cases. 

Figure 3-11 shows an example of how the 64-bit value of 0x0123456789ABC is transferred 
through BYTE, WORD, DWORD and QWORD ports using LITTLE ENDIAN data organiza- 
tion Through the 64-bit QWORD port the number is directly transferred with the most signifi- 
cant bit at DAT I / DAT 0(63). The least significant bit is at DATJ / DAT_O(0). However, 
when the same operand is" transferred through a 32-bit DWORD port, it i^plit »nto two 1 ^us cy- 
cles The two bus cycles are each 32-bits in length, with the least significant DWORD trans- 
ferred at the lower address, and the most significant DWORD transferred at the upper address. A 
similar situation applies to the WORD and BYTE cases. 
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OFFSET 
ADDRESS 



63 



DAT I / DAT_0 



00 



QWORD(O) 


DWORD (0) 


DWORD (1) 


WORD(0) 


WORD(l) 


WORD (2) 


WORD (3) 


BYTE ( 0 ) BYTE ( 1 ) 


BYTE (2) BYTE (3) 


BYTE (4) | BYTE (5) 


BYTE (6) | BYTE (7) 



(a) BIG ENDIAN BYTE, WORD, DWORD and QWORD positioning in a 64-bit operand. 
63 DAT I / DAT_0 00 



QWORD (0) 


DWORD (1) 


DWORD (0) 


WORD (3) 


WORD (2) 


WORD(l) 


WORD(0) 


BYTE (7) BYTE (6) 


BYTE (5) BYTE (4) 


BYTE (3) BYTE (2) 


BYTE ( 1 ) BYTE { 0 ) 



(b) LITTLE ENDIAN BYTE, WORD, DWORD and QWORD positioning in a 64 -bit operand. 



63 



DAT I / DAT O 



00 



QWORD (0) 



DAT I / DAT_0 
07 00 






BYTE (7) 








BYTE {6) 








BYTE (5) 








BYTE (4) 




15 DAT I / DAT O 00 




BYTE (3) 




WORD (3) 




BYTE (2) 




WORD (2) 




BYTE ( 1 ) 




WORD(l) 




BYTE ( 0 ) 




WORD(0) 


BYTE 
ORDERING 




WORD 
ORDERING 



QWORD 
ORDERING 



31 



DAT I / DAT_0 



DWORD (1) 



DWORD (0) 



DWORD 
ORDERING 



(c) Address nomenclature. 



Figure 3-9. Operand locations for input and output data ports. 
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7 

6 

5 

4 

3 
2 
1 
0 



OFFSET 
ADDRESS 



63 



DAT I / DAT_0 



0X0123456789ABCDEF 



DAT_I / DAT_0 
07 00 



QWORD 
ORDERING 





OxEF 






OxCD 




OxAB 




0x89 




0x67 




0x45 




0x23 




0x01 





15 DAT I / DATOOa 



OxCDEF 



0x89AB 



0x4567 



0x0123 



•31 



BYTE 
ORDERING 



WORD 
ORDERING 



DAT I / DAT J) 



0X89ABCDEF 



0x01234567 



DWORD 
ORDERING 



00 



00 



Figure H£ ExamEle showin g a variety of RIO FNDIAN transfers over various port sizes. 
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OFFSET 
ADDRESS 



63 



DAT I / DAT_0 



0x0123456789ABCDEF 



QWORD 
ORDERING 



DAT_I / DAT_0 
07 00 





0x01 






0x23 




0x45 




0x67 




0x89 




OxAB 




OxCD 




OxEF 





15 DAT I / DAT J) 00 



0x0123 



0X4567 



0x8 9AB 



OxCDEF 



31 



BYTE 
ORDERING 



WORD 
ORDERING 



DAT I / DAT_0 



0x01234567 



0X89ABCDEF 



DWORD 
ORDERING 



00 



00 



v, a^ 3J 1 ExamEie showing » variety ofj ™ £ INDIAN tnmfers over various port sizes. 
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RULE 3.80 

Data organization MUST conform to the ordering indicated in Figure 3-9. 



3.5.2 Transfer Sequencing 

The sequence in which data is transferred through a port is not regulated by this specification. 
For example, a 64-bit operand through a 32-bit port will take two bus cycles. However, the 
specification does not require that the lower or upper DWORD be transferred first. 

RECOMMENDATION 3.10 

Design interfaces so that data is transferred sequentially from lower addresses to a higher ad- 
dresses. 

OBSERVATION 3.30 

The sequence in which an operand is transferred through a data port is not highly regulated by 
the specification. That is because different IP cores may produce the data in different ways. The 
sequence is therefore application-specific. 



3.5.3 Data Organization for 64-bit Ports 
RULE 3.85 

Data organization on 64-bit ports MUST conform to Figure 3-12. 
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64-bit Data Bus With 8-bit (BYTE) 


Granularity 








Address 


Active Portion of Data Bus 








Range : 

ADR I 
ADR 0 
{63. ~03) 


DAT I 
DAT 0 
{63.36) 


DAT I 
DAT 0 
(55.. "48) 


DAT I 
DAT 0 
(47.. "40) 


DAT I 
DAT 0 
{39.. "32) 


DAT I 
DAT 0 
(31. .^4) 


DAT I 
DAT 0 
(23.. 16) 


DAT I 
DAT 0 
(15. D8) 


DAT I 
DAT 0 
(07.. DO) 




Active 
Select 
Line ! 


SEL 1(7) 
SEL_0{7) 


SEL 1(6) 
SEL_0<6) 


SEL 1(5) 
SEL_0{5) 


SEL 1(4) 
SEL_0(4) 


SEL 1(3) 
SEL_0(3) 


SEL 1(2) 
SEL"0(2) 


SEL 1(1) 
SEL^O(l) 


SEL I{0) 
SEL_O{0) 




BIG 
END I AN 


BYTE ( 0 ) 


BYTE ( 1 ) 


BYTE (2) 


BYTE (3) 


BYTE (4) 


BYTE (5) 


BYTE (6) 


BYTE (7) 


BYTE 
Ordering 


LITTLE 
END I AN 


BYTE (7) 


BYTE (6) 


BYTE (5) 


BYTE (4) 


BYTE (3) 


BYTE (2) 


BYTE (1) 


BYTE (0) 










64-bit Data Bus With 16-bit {WORD} Granularity 




Address 


Active Portion of Data Bus 






Range 

ADR I 
i ADR 0 
{63. 702) 


DAT I 
DAT 0 
(63. 748) 


DAT I 
DAT 0 
(47. ."32) 


DAT I 
DAT"0 
(31.. "16) 


DAT I 
DAT 0 
(15. TOO) 




Active 
Select 
Line 


SEL 1(3) 
SEL_0(3) 


SEL 1(2) 
SEL~0{2) 


SEL 1(1) 
SEL"0(1) 


SEL 1(0) 
SEL"O(0) 




BIG 
END I AN 


WORD(0) 


W0RD{1) 


WORD (2) 


WORD (3) 


WORD 
Ordering 


LITTLE 
END I AN 


WORD (3) 


WORD (2) 


W0RD{1) 


WORD(0) 



P 64-bit Data Bus With 32-bit (DWORD) Granularity 




Address 
Range 

ADR I 
ADR O 
(63. Dl) 


Active Portion of Data Bus 


DAT I 
DAT*~0 
(63.32) 


DAT I 1 
DAT - O 
(31. "00) 


Active 
Select 
Line 


SEL 1(1) 
SEL_0{1) 


SEL 1(0) 
SEL~O(0) 


DWORD 
Ordering 


BIG 
END I AN 


DWORD (0) 


DWORD (1) 


LITTLE 
END I AN 


DWORD (1) 


DWORD (0) 



64-bit Data Bus With 64-bit (QWORD) Granularity 




Address 


Active Portion of Data Bus 




Range 

ADR I 
ADR - O 
(63. .DO) 


DAT I 
DAT O 
(63.. DO) 




Active 
Select 
Line 


SEL 1(0) 
SEL O(0) 


QWORD 
Ordering 


BIG 
END I AN 


QWORD (0) 


LITTLE 
END I AN 


QWORD (0) 



Fipure 3-12- Data organization for 6 4-bit ports. 
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3.5.4 Data Organization for 32-bit Ports 
RULE 3.90 

Data organization on 32-bit ports MUST conform to Figure 3-13. 



32-bit Data Bus With 8-bit (BYTE) Granularity 




Address 
Range 

ADR I 
ADR~0 
(63. .1)2) 


Active Portion of Data Bus 


DAT I 
DAT - 0 
{31.: "24) 


DAT I 
DAT - 0 
(23. .16) 


DAT I 
DAT~0 
(15.. 1)8) 


DAT I 
DAT~0 
(07 . .DO) 


Active 
Select 
Line 


SEL 1(3) 
SEL"0(3) 


SEL 1(2) 
SEL~0(2) 


SEL 1(1) 
SEL~0(1) 


SEL 1(0) 
SEL~O{0) 


BYTE 
Ordering 


BIG 
END IAN 


BYTE { 0 ) 
BYTE (4) 


BYTE { 1 ) ' 
BYTE (5) 


BYTE (2) 
BYTE (6) 


BYTE { 3 ) 
BYTE (7) 


LITTLE 
END IAN 


BYTE (3) 
BYTE (7) 


BYTE (2) 
BYTE (6) 


BYTE ( 1 ) 
BYTE (5) 


BYTE ( 0 ) 
BYTE (4) 



32-bit Data Bus With 16-bit (WORD) Granularity 




Address 
Range 

ADR I 
ADR 0 
(63.. Dl) 


Active Portion of Data Bus 


DAT I 
DAT 0 
(31.. 16) 


DAT I 
DAT 0 
(15.. DO) 


Active 
Select 
Line 


SEL 1(1) 
SEL^O(l) 


SEL 1(0) 
SEL~O(0) 


WORD 
Ordering 


BIG 
END IAN 


WORD(0) 
WORD (2) 


W0RD(1) 
WORD (3) 


LITTLE 
END I AN 


WORD(l) 
WORD (3) 


WORD{0) 
WORD (2) 



32-bit Data Bus With 32-bit (DWORD) Granularity 




Address 
Range 

ADR I 
ADR - 0 
(63. DO) 


Active Portion of Data Bus 


DAT I 
DAT"0 
(31. DO) 


Active 
Select 
Line 


SEL 1(0) 
SELJD{0) 


DWORD 
Ordering 


BIG 
END I AN 


DWORD (0) 
DWORD (1) 


LITTLE 
END I AN 


DWORD (0) 
DWORD (1) 



Figure 3-13. Data organization for 32-bit ports. 
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3.5.5 Data Organization for 16-bit Ports 
RULE 3.95 

Data organization on 16-bit ports MUST conform to Figure 3- 



16-bit Data Bus With 
8-bit (BYTE) Granularity 




Address 


Active Portion 
of Data Bus 


Range 

ADR I 
ADR 0 
(63.. Dl) 


DAT I 
DAT 0 
(15.. D8) 


DAT I 
DAT 0 
(07. .DO) 


Active 
Select 
Line 


SEL 1(1) 
SELJMD 


SEL 1(0) 
SELJD(O) 


BYTE 
Ordering 


BIG 
END IAN 


BYTE ( 0 j 
1 BYTE (2) 
BYTE (4) 
BYTE (6) 


BYTE ( 1 ) 
BYTE (3) 
BYTE (5) 
BYTE (7) 


LITTLE 
ENDIAN 


BYTE ( 1 ) 
BYTE (3) 
BYTE (5) 
BYTE (7) 


BYTE ( 0 ) 
BYTE (2) 
BYTE (4) 
BYTE (6) 



16-bit Date 
16-bit (WORD) 


i Bus With 
Granularity 




Address 


Active Portion 
of Data Bus 


Range 

ADR I 
ADR 0 
(63.. DO) 


DAT I 
DAT 0 
(15. DO) 


Active 
Select 
Line 


SEL 1(0) 
SEL_O(0) j 


WORD 
Ordering 


BIG 
ENDIAN 


WORD(0) 
WORD(l) 
WORD (2) 
WORD (3) 


LITTLE 
ENDIAN 


WORD(0) 
WORD(l) 
WORD (2) 
WORD (3) 



Figure 3-14. Data organi sation for 16-bit ports. 
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3.5.6 Data Organization for 8-bit Ports 



RULE 3.100 

Data organization on 8-bit ports MUST conform to Figure 3-15. 



8-bit Data Bus With 
8-bit (BYTE) Granularity 




Address 
Range 


Active 
Portion 
of 

Data Bus 




ADR I 
ADR 0 
(63. .DO) 


DAT I 
DAT 0 
(07. .DO) 




Active 
Select 
Line 


SEL 1(0) 
SEL_O(0) 


BYTE 


BIG 
END IAN 


BYTE ( 0 ) 
BYTE (1) 
BYTE (2) 
BYTE (3) 
BYTE (4) 
BYTE (5) 
BYTE (6) 
BYTE (7) 


Ordering 


LITTLE 
END I AN 


BYTE ( 0 ) 
BYTE ( 1 ) 
BYTE (2) 
BYTE ( 3 ) 
BYTE (4) 
BYTE (5) 
BYTE (6) 
BYTE (7) 



Figure 3-15. Data organization for 8-bit ports. 



3.6 References 

Cohen, Danny. On Holy Wars and a Plea for Peace. IEEE Computer Magazine . October 1981. 
Pages 49-54. [Description of BIG ENDIAN and LITTLE ENDIAN.] 
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Chapter 4 - Timing Specification 

The WISHBONE specification is designed to provide the end user with very simple timing ; con- 
straints AUhough the application specific circuit(s) will vary in this regard the interface tse If is 
Sedto work without need for detailed timing specification, In all cases the only t,m- 
fng information that is needed by the end user is the maximum ^fr^M 
that is oassed to a place & route tool. The maximum clock frequency is dictated by the time de 
C clock edge on [CLKJ] to the setup on a stage ^h- " log.cal 

signal path. This delay is shown graphically in Figure 4-1, and ,s defined as Tpd,clk-su. 



Tpd ; clk-su = 1/Fclk 



SIGNAL 



CLK I 



D 
CE 



INTERCONNECTION 
LjOGIC & ROUTE DELAY 



D 
CE 



Figure 4-1. Definition for Tpd.clk-su. 



tE ^ock^Put [CLK I] to each IP core MUST coordinate all activities for the internal logic 
witlin tfe WISHBONE interface. All WISHBONE output signals are registered at the Rising 
Tdge of [CLKJ]. All WISHBONE input signals must be stable before the rising edge of 
[CLKJ]. 

PERMISSION 4.00 

The user's place and route tool MAY be used to enforce this rule. 

SSS^™i^b -n be easily configured to enforce this rule. Generally, it only re- 
quires a single timing specification for Tpd,clk-su. 



The wfstlBONE interface MUST use synchronous, RTL design methodologies that, given 
n«nLly fast gate delays, will operate over a nearly infinite range of clock frequencies on 
[CLKJ]. 
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OBSERVATION 4.05 

Realistically, the WISHBONE interface will never be expected to operate over a nearly infinite 
frequency range. However this requirement eliminates the need for non-portable timing con- 
straints (that may work only on certain target devices). 



OBSERVATION 4.10 

The WISHBONE interface logic assumes that a low-skew clock distribution scheme is used on 
the target device, and that the clock-skew shall be low enough to permit reliable operation over 
the environmental conditions. 

4 

PERMISSION 4.05 

The IP core connected to a WISHBONE interface MAY include application specific timing re- 
quirements. 

RULE 4.10 

The clock input [CLKJ] MUST have a duty cycle that is no less than 40%, and no greater than 
60%. 



PERMISSION 4.10 

The SYSCON interface MAY use a variable clock generator. In these cases the clock frequency 
can be changed by the SYSCON interface so long as the clock edges remain clean and mono- 
tonic, and if the clock does not violate the duty cycle requirements. 



PERMISSION 4.15 

The SYSCON interface MAY use a gated clock generator. In these cases the clock shall be 
stopped in the low logic state. When the gated clock is stopped and started the clock edges must 
remain clean and monotonic. 



SUGGESTION 4.00 

When using a gated clock generator, turn the clock off when the WISHBONE interconnection is 
not busy. One way of doing this is to create a MASTER interface whose sole purpose is to ac- 
quire the WISHBONE interconnection and turn the clock off. This assures that the WISHBONE 
interconnection is not busy when gating the clock off. When the clock signal is restored the 
MASTER then releases the WISHBONE interconnection. 

OBSERVATION 4.15 

This specification does not attempt to govern the design of gated or variable clock generators. 
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SUGGESTION 4 10 

Design an IP core'so that all of the circuits (including the WISHBONE interconnect) follow the 
aforementioned RULEs, as this will make the core portable across a wide range of target devices 
and technologies. 



References 
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Appendix A - WISHBONE Tutorial 4 

By: Wade D. Peterson, Silicore Corporation 

The WISHBONE System-on-Chip (SoC) interconnection is a method for connecting digital cir- 
cuits together to form an integrated circuit 'chip'. This tutorial provides an introduction to the 
WISHBONE design philosophy and its practical applications. 

The WISHBONE architecture solves a very basic problem in integrated circuit design. That is, 
how to connect circuit functions together in a way that is simple, flexible and portable. The cir- 
cuit functions are generally provided as 'IP Cores' (Intellectual Property Cores), which system 
integrators can purchase or make themselves. 

Under this topology, IP Cores are the functional building blocks in the system. They are avail- 
able in a wide variety of functions such as microprocessors, serial ports, disk interfaces, network 
controllers and so forth. Generally, the IP cores are developed independently from each other 
and are tied together and tested by a third party system integrator. WISHBONE aides the system 
integrator by standardizing the IP Core interfaces. This makes it much easier to connect the 
cores, and therefore much easier to create a custom System-on-Chip. 



A.1 Am Introduction to WISHBONE 

WISHBONE uses a MASTER/SLAVE architecture. That means that functional modules with 
MASTER interfaces initiate data transactions to participating SLAVE interfaces. As shown in 
Figure A-l, the MASTERs and SLAVEs communicate through an interconnection interface 
called the INTERCON. The INTERCON is best thought of as a 'cloud' that contains circuits. 
These circuits allow MASTERs to communicate with SLAVEs. 

The term 'cloud' is borrowed from the telecom community. Oftentimes, telephone systems are 
modeled as clouds that represent a system of telephone switches and transmission devices. Tele- 
phone handsets are connected to the cloud, and are used to make phone calls. The cloud itself 
represents a network that carries a telephone call from one location to another. The activity go- 
ing on inside the cloud depends upon where the call is made and where it is going. For example, 
if a call is made to another office down the hall, then the cloud may represent a local telephone 
switch located in the same building. However, if the call is made across an ocean, then the cloud 
may represent a combination of copper, fiber optic and satellite transmission systems. 



4 This tutorial is not part of the WISHBONE specification. 
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Figure A-l. The WISHBONF. interconnection. 
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The cloud analogy is used because WISHBONE can be modeled in a similar way. MASTER 
and SLAVE interfaces (which are analogous to the telephones) communicate thorough an inter- 
connection (which is analogous to the telephone network 'cloud'). The WISHBONE intercon- 
nection network can be changed by the system integrator to suit his or her own needs. In 
WISHBONE terminology this is called a variable interconnection. 

Variable interconnection allows the system integrator to change the way in which the MASTER 
and SLAVE interfaces communicate with each other. For example, a pair of MASTER and 
SLAVE interfaces can communicate with point-to-point, data flow, shared bus or crossbar switch 
topologies. 

The variable interconnection scheme is very different from that used in traditional microcom- 
puter buses such as PCI, cPCI, VMEbus and ISA bus. Those systems use printed circuit back- 
planes with hardwired connectors. The interfaces on those buses can't be changed, which se- 
verely limits how microcomputer boards communicate with each other. WISHBONE overcomes 
this limitation by allowing the system integrator to change the system interconnection. 

This is possible because integrated circuit chips have interconnection paths that can be adjusted. 
These are very flexible, and take the form of logic gates and routing paths. These can be 'pro- 
grammed' into the chip using a variety of tools. For example, if the interconnection is described 
with a hardware description language like VHDL or Verilog®, then the system integrator has the 
ability to define and adjust the interconnection. Interconnection libraries can also be formed and 
shared. 

The WISHBONE interconnection itself is nothing more than a large, synchronous circuit. It 
must be designed to logically operate over a nearly infinite frequency range. However, every 
integrated circuit has physical characteristics that limit the maximum frequency of the circuit. In 
WISHBONE terminology this is called a variable timing specification. This means that a 
WISHBONE compatible circuit will theoretically function normally at any speed, but that it's 
maximum speed will always be limited by the process technology of the integrated circuit. 

At Silicore Corporation we generally define our WISHBONE interconnections using the VHDL 
hardware description language. These take the form of an ASCII file that contains the VHDL 
language instructions. This allows us to fully define our interconnections in a way that best fits 
the application. However, it also allows us to share our interconnections with others, who can 
adjust them to meet their own needs. It's important to note, though, that there's nothing magic 
about the interconnection itself. It's just a file, written with off-the-shelf tools, that fully de- 
scribes the hardware in the interconnection. 
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A 2 Types of WISHBONE Interconnection 



cores connect to each other, 
include: 

• Point-to-point 

• Data flow 



Shared bus 
Crossbar switch 



integrated circuits can be connected using a point-to-point interconnection. 

The WISHBONE specification does not dictate how any of, fc~ « < ^plemented by ^ system 
integrator. That's because the interconnection itself is a ^ or create their 

TERCON. The system integrator can use or modify an off-the-shelf INTLKLUN, 



own. 



A 2 1 Point-to-point Interconnection 

gether. As shown in Figure A 2, the _po m p MASTER interface could be 



WISHBONE 




WISHBONE 


MASTER 


pd 


SIAVE 



Fi pnre A-2. T he point-to-P ™* interconnection. 



A 2 2 Data Flow Interconnection 

rerface. Da* . flows from core-.o-core. Sometimes this process ,s called p^tormg. 
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IP CORE 'C f 


WISHBONE 
SIAVE 
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,x, 


WISHBONE 
SIAVE 

WISHBONE 
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WISHBONE 
SIAVE 

WISHBONE 
MASTER 



DIRECTION OF DATA FLOW 



Figure A-3. The data flow interconnection. 

The data flow architecture exploits parallelism, thereby speeding up execution time. For exam- 
ple, if each of the IP cores in the Figure represents a floating point processor, then the system has 
three times the number crunching potential of a single unit. This assumes, of course, that each IP 
Core takes an equal amount of time to solve its problem, and that the problem can be solved in a 
sequential manner. In actual practice this may or may not be true, but it does illustrate how the 
data flow architecture can provide a high degree of parallelism when solving problems. 



A.2.3 Shared Bus Interconnection 

The shared bus interconnection is useful for connecting two or more MASTERS with one or 
more SLAVEs. A block diagram is shown in Figure A-4. In this topology a MASTER initiates 
a bus cycle to a target SLAVE. The target SLAVE then participates in one or more bus cycles 
with the MASTER. 

An arbiter (not shown in the Figure) determines when a MASTER may gain access to the shared 
bus. The arbiter acts like a 'traffic cop' to determine when and how each MASTER accesses the 
shared resource. Also, the type of arbiter is completely defined by the system integrator. For 
example, the shared bus can use a priority or a round robin arbiter. These grant the shared bus 
on a priority or equal basis, respectively. 

The main advantage to this technique is that shared interconnection systems are relatively com- 
pact. Generally, it requires fewer logic gates and routing resources than other configurations, 
especially the crossbar switch. Its main disadvantage is that MASTERS may have to wait before 
gaining access to the interconnection. This degrades the overall speed at which a MASTER may 
transfer data. 
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SLAVE 
'SB' 



WISHBONE 
SLAVE 
'SC 



Figure A-4 Shared bus interconnection. 

The WISHBONE specification does no, ^^*^^S^<^ 
work better with three-state buses. 

cases this is done with three-state buses. 



A.2.4 Crossbar Switch Interconnection 

The crossbar switch interconnection is used when ^"^^Xg,™™ £f£ 
£ SUAVE. A-arhiter (no^^ 

^ n,ll:l" A ER E t o "so ^connection (as ,on g as two MASTERS don, access 
the same SLAVE at the same time). 
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Figure A-5. Crossbar switch interconnection. 



Under this method, each master arbitrates for a 'channel 5 on the switch. Once this is established, 
data is transferred between the MASTER and the SLAVE over a private communication link. 
The Figure shows two possible channels that may appear on the switch. The first connects 
MASTER 'MA' to SLAVE 'SB'. The second connects MASTER 'MB' to SLAVE 'SA\ 

The overall data transfer rate of the crossbar switch is higher than shared bus mechanisms. For 
example, the figure shows two MASTER/SLAVE pairs interconnected at the same time. If each 
communication channel supports a data rate of 100 Mbyte/sec, then the two data pairs would op- 
erate in parallel at 200 Mbyte/sec. This scheme can be expanded to support extremely high data 
transfer rates. 

One disadvantage of the crossbar switch is that it requires more interconnection logic and routing 
resources than shared bus systems. As a rule-of-thumb, a crossbar switch with two MASTERS 
and two SLAVEs takes twice as much interconnection logic as a similar shared bus system (with 
two MASTERS and two SLAVEs). 

The crossbar interconnection is a typical configuration that one might find in microcomputer 
buses like 5 RACEway, SKY Channel or Myrinet. 



5 Raceway: AN S WIT A 5-1994. SKYchannel: ANSI/VITA 10-1995. Myrinet: ANSI/VITA 26-1998. For more 
information about these standards see www.vita.com . 
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A.3 The WISHBONE Interface Signals 

WISHBONE MASTER and SLAVE interfaces can be connected together in a number of ways 
This requires that WISHBONE interface signals and bus cycles be designed in a very flex.ble 
and reusable manner. The signals were defined with the following requirements: 

. The signals allow MASTER and SLAVE interfaces to support point-to-point, data flow, 
shared bus and crossbar switch interconnections. 

. The signals allow three basic types of bus cycle. These include SINGLE READ/WRITE 
BLOCK READ/WRITE and RMW (read-modify-write) bus cycles. The operation ot 
these bus cycles are described below. 

. A handshaking mechanism is used so that either the MASTER or ^ Partichpating 
SLAVE interface can adjust the data transfer rate during a bus ^ Th « dlows Ae 
speed of each bus cycle (or phase) to be adjusted by either the MASTER or the SLAVE 
interface. This means that all WISHBONE bus cycles run at the speed of the slowest in- 
terface. 

. The handshaking mechanism allows a participating SLAVE to accept a data transfer re- 
ject a data transfer with an error or ask the MASTER to retry a bus cycle. The SLAVE 
does this by generating the [ACKO], [ERR_0] or [RTYO] signals respectively Every 
interface must support the [ACK_0] signal, but the error and retry acknowledgement 
signals are optional. 

. All signals on MASTER and SLAVE interfaces are either inputs or outputs but are never 
^directional (i.e. three-state). This is because some FPGA and ASIC devices do no 
support bi-directional signals. However, it is permissible (and sometimes advantageous) 
to use bi-directional signals in the interconnection logic if the target device supports it. 

. Address and data bus widths can be altered to fit the application. 8, 16, 32 and 64-bit 
data buses, and 0-64 bit address buses are defined. 

. As shown in Figure A-6, all signals are arranged so that MASTER, SLAVE and 
SYSCON interfaces can be connected directly together to form a simple ■ P^*-P°£ 
interface. This allows very compact and efficient WISHBONE interfaces to be bu.lt. For 
example WISHBONE could be used as the external system bus on a microprocessor IF 
Core However, it's efficient enough so that it can be used for internal buses inside of the 
microprocessor. 

. User defined signals in the form of 'tags' are allowed. This allows the system ^ integrator 
to add special purpose signals to each WISHBONE interface. For example, the system 
integrator could add a parity bit to the address or data buses. 

A comprehensive list of the WISHBONE signals and their descriptions is given in the specify 



tion. 
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SYSCON 
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(A) FORMING A POINT-TO-POINT INTERCONNECTION . 
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(B) POINT-TO-POINT INTERCONNECTION . 



Figure A-6. The WISHBONE signals are selected to permit a MASTER, SLAVE and SYSCON 
interface to be directly connected, thereby forming a simple point-to-point interface. 
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A.4 The WISHBONE Bus Cycles 

There are three types of defined WISHBONE bus cycles. They 

. SINGLE READ/WRITE 
. BLOCK READ/WRITE 
• READ MODIFY WRITE (RMW) 



A.4.1 SINGLE READ/WRITE Cycle 

The SINGLE READ/WRITE is the most basic WISHBONE bus ^^^X^ " 
is used to transfer a single data operand. Figure A-7 shows a typical SINGLE READ cycle. 

The WISHBONE specification shows all bus cycle timing diagrams as if the MASTER and 
It AVE interfaces were connected in a point-to-point configuration. They also show all of he 
SLAVE interlaces were ^onne * ^ provides a standard way of describing the 

READ cycle operates thusly: 

1 In response to clock edge 0, the MASTER interface asserts [ADR_O 0 ], [WEO], [SELO], 
[STB_0] and [CYC_0]. 

0 Th, ST AVE decodes the bus cycle by monitoring its [STBJ] and address inputs, and pres- 
tion, the SLAVE [DAT_0()1 signals are connected to the MASTER [DAT_I()J signals. 

1 ThP ST AVE indicates that it has placed valid data on the data bus by asserting the MAS- 
TER s IAGK if acknowledge signal. Also note that the SLAVE may delay its response by 
inSt n £ne o more wait Sates 8 In this case, the SLAVE does not assert the acknowledge 
line The possibility of a wait state in the timing diagrams is indicated by -WSS- . 

4. The MASTER monitors the state of its [ACK_I] line, and determines that the SLAVE has 
acknowledged the transfer at clock edge 1. 

5. The MASTER latches [DAT_1()] and negates its [STB_0] signal in response to [ACKJ]. 



latched the data. 
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Figure A-7. SINGLE READ cycle. 
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A.4.2 BLOCK READ/WRITE Cycle 

The BLOCK READ/WRITE e y e,es are very simi,ar to .he 

di orK cvc ies can be thought of as two or more back-to-back SINGLE cycles strung iogc 
ta^S^^mtaology the term cycle refers to the whole BLOCK cycle. The small, md, 
vidual data transfers that make up the BLOCK cycle are Cephases. 

■ * ~f+u„ m nrK rvcles are identified by the assertion and negation 
The starting and stopping point of he BLOCK cycles are menu y ^ 

of the MASTER [CYC_0] signal (respectively). The [CYC_0] ^» "J? ™ wish es to use 
and crossbar interconnections because it informs system logic that the MASTER wishes 
^ bus It a so informs the interconnection when it is through with its bus cycle. 



A.4.3 READ-MODIFY- WRITE (RMW) Cycle 

tw RF AD MODIFY-WR1TE cycle is used in multiprocessor and multitasking systems. This 
SSJ^SKSU software processes to share common resources jby using sema 
nhnres This is commonly done on interfaces for disk controllers, senal ports and memory As 
phores. This s commonly WRITE , reads and writ es data to a memory loca- 

aruassos*B*sjW«« 

BLOCK cycle where the first phase is a read and the second phase is a write. 

The RFAD MODIFY-WRITE cycle is also known as an indivisible cycle because it is designed 

signal is always asserted for the duration of the RMW cycle. 

To illustrate this point, consider a two processor system with a single disk "J^^*^. 

Sle=^^^ 
then the disk controller is busy. 

' ' '. ♦ f«rrr, r,f Hkk 'thrashine' In this case, if both processors were al- 
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Now consider a system where the two processors both need to use the disk. We'll call them 
processor #0 and processor #1. In order for processor #0 to acquire the disk it first reads and 
stores the state of the semaphore bit, and then sets the bit by writing back to memory. The read- 
ing and setting of the bit takes place inside of a single RMW cycle. 

Once the processor is done with the semaphore operation, it checks the state of the bit it read 
during the first phase of the RMW cycle. If the bit is clear it goes ahead and uses the disk con- 
troller. If the other processor attempts to use the disk controller at this time, it reads a 'V from 
the semaphore, thereby preventing it from accessing the disk controller. When the first processor 
(#0) is done with the disk controller, it simply clears the semaphore bit by writing a 6 0' to it. 
This allows the other processor to gain access to the controller the next time it checks the sema- 
phore. 

Now consider the same situation, except where the semaphore is set and cleared using a SINGLE 
READ cycle followed by a SINGLE WRITE cycle. In this case it is possible for both processors 
to gain access to the disk controller at the same time... a situation that would crash the system. 
That's because the arbiter can grant the bus in the following order: 

o Processor #0 reads '0' from the semaphore bit. 

o Processor #1 reads '0' from the semaphore bit. 

o Processor #0 writes 4 V to the semaphore bit. 

o Processor # 1 writes 6 T to the semaphore bit. 

This leads to a system crash because both processors read a '0' from the semaphore bit, thereby 
causing both to access the disk controller. 

It is important to note that a processor (or other device connected to the MASTER interface) 
must support the RMW cycle in order for this to be effective. This is generally done with special 
instructions that force a RMW bus cycle. Not all processors do this. A good example is the 
680XX family of microprocessors. These use special compare-and-set (CAS) and test-and-set 
(TAS) instructions to generate RMW cycles, and to do the semaphore operations. 



A.5 Eedian 

The WISHBONE specification regulates the ordering of data. This is because data can be pre- 
sented in two different ways. In the first way, the most significant byte of an operand is placed 
at the higher (bigger) address. In the second way, the most significant byte of an operand can be 
placed at the lower (smaller) address. These are called BIG ENDIAN and LITTLE END1AN 
data operands, respectively. WISHBONE supports both types. 

ENDIAN becomes an issue when the granularity of a WISHBONE port is smaller than the oper- 
and size. For example, a 32-bit port can have an 8-bit (BYTE wide) granularity. This results in 
a fairly ambiguous situation where the most significant byte of the 32-bit operand could be 
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placed at the higher or lower byte address of the port. However, ENDIAN is not an issue when 
the granularity and port size are the same. 

The system integrator may wish to connect a BIG ENDIAN interface to a LITTLE ENDIAN in- 
teface. In many cases the conversion is quite straightforward, and does not require any exotic 
conversion logic. Furthermore, the conversion does not create any speed degradation in the in- 
terface. In general, the ENDIAN conversion takes place by renaming the data and select I/O sig- 
nals at a MASTER or SLAVE interface. 

Figure A-8 shows a simple example where a 32-bit BIG ENDIAN MASTER output (CORE 'A') 
is connected to a 32-bit LITTLE ENDIAN SLAVE input (CORE 'B'). Both interfaces have 32- 
bit operand sizes and 8-bit granularities. As can be seen in the diagram, the ENDIAN conversion 
is accomplished by cross coupling the data and select signal arrays. This is quite simple since 
the conversion is accomplished at the interconnection level, or using a wrapper. This is espe- 
cially simple in soft IP cores using VHDL or Verilog® hardware description languages, as it 
only requires the renaming of signals. 

In some cases the address lines may also need to be modified between the two cores. For exam- 
ple, if 64-bit operands are transferred between two cores with 8-bit port sizes, then the address 
lines may need to be modified as well. 



CORE 'A' 
MASTER OUTPUT 
BIG ENDIAN 




CORE 'B' 
SLAVE INPUT 
LITTLE ENDIAN 



Figure A-8. BIG ENDIAN to LITTLE ENDIAN conv ersion example. 
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A.6 SLAVE I/O Port Examples 

In this section we'll investigate several examples of WISHBONE interface for SLAVE I/O ports. 
Our purpose is to: 

o Show some simple examples of how the WISHBONE interface operates. 

o Demonstrate how simple interfaces work in conjunction with standard logic primitives on 

FPGA and ASIC devices. This also means that very little logic is needed to implement the 

WISHBONE interface, 
o Demonstrate the concept of granularity, 
o Provide some portable design examples, 
o Give examples of the WISHBONE DATASHEET, 
o Show VHDL implementation examples. 



A.6.1 Simple 8-bit SLAVE Output Port 

Figure A-9 shows a simple 8-bit WISHBONE SLAVE output port. The entire interface is im- 
plemented with a standard 8-bit 'D-type' flip-flop register (with synchronous reset) and a single 
AND gate. During write cycles, data is presented at the data input bus [DAT_I(7..0)] 9 and is 
latched at the rising edge of [CLK_I] when [STB J] and [WE_I] are both asserted. 



QJ 
4-1 

u 

0) 



DAT 0(7.. 0) <J- 



DAT_I(7..0) — 

ack o <y 



STB I 
WE~1 



RST_I 
CLK I 



D Q 



CE 



RESET 
> 



8-BIT 
D-TYPE 
REGISTER 



-> PRT 0(7.. 0) 



Figure A-9. Simple 8-bit WISHBONE SLAVE output port. 



The state of the output port can be monitored by a MASTER by routing the output data lines 
back to [DAT_O(7..0)]. During read cycles the AND gate prevents erroneous data from being 
latched into the register. 

This circuit is highly portable, as all FPGA and ASIC target devices support D-type flip-flops 
with clock enable and synchronous reset inputs. 
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The circuit also demonstrates how the WISHBONE interface requires little or no logic overhead. 
In this case, the WISHBONE interface does not require any extra logic gates whatsoever. This is 
because WISHBONE is designed to work in conjunction with standard, synchronous and combi- 
natorial logic primitives that are available on most FPGA and ASIC devices. 

The WISHBONE specification requires that the interface be documented. This is done in the 
form of the WISHBONE DATASHEET. The standard does not specify the form of the da- 
tasheet. For example, it can be part of a comment field in a VHDL or Verilog® source file or 
part of a technical reference manual for the IP core. Table A-l shows one form of the WISH- 
BONE DATASHEET for the 8-bit output port circuit. 

The purpose of the WISHBONE DATASHEET is to promote design reuse. It forces the origi- 
nator of the IP core to document how the interface operates. This makes it easier for another 
person to re-use the core. 



Table A-l. WISHBONE DATASHEET for the 8-bit output port example. 


Description 


Specification 


General description: 


8-bit SLAVE output port. 


Supported cycles: 


SLAVE, READ/WRITE 
SLAVE, BLOCK READ/WRITE 
SLAVE, RMW 


Data port, size: 

Data port, granularity: 

Data port, maximum operand size: 

Data transfer ordering: 

Data transfer sequencing: 


8-bit 
8-bit 
8-bit 

Big endian and/or little endian 
Undefined 


Supported signal list and cross reference 
to equivalent WISHBONE signals: 


Sianal Name WISHBONE Equiv. 
ACK 0 ACK_0 
CLK I CLK_I 
DAT I(7..0) DAT I() 
DAT O(7..0) DAT 0() 
RST I RSTJ 
STB I STB I 
WE I WE I 



Figure A-10 shows a VHDL implementation of same circuit. The WBOPRT08 entity imple- 
ments the all of the functions shown in the schematic diagram of Figure A-9. 



WISHBONE SoC Architecture Specification, Revision B.2 



80 



library i'eee; 

use ieee . std_logic_1164 . all ; 

entity WBOPRT08 is 
port ( 

-- WISHBONE SLAVE interface: 



ACK_0 


out 


std_logic; 








CLK_I 


in 


std_logic; 








DAT_I 


in 


std_logic_vector ( 


7 


downto 


0 ) ; 


DAT_0 


out 


std_logic_vector ( 


7 


downto 


0 ) ; 


RST_I 


in 


std_logic; 








STB_I 


in 


std_logic; 








WE_I : 


in 


std_logic; 








-- Output port 


(non- WISHBONE signals 


0 






PRT_0 


out 


std_logic_vector ( 


7 


downto 


0 ) 



); 

end entity WBOPRT08 ; 

architecture WBOPRT081 of WBOPRT08 is 



signal Q: std_logic_vector ( 7 downto 0 ); 
begin 

REG: process ( CLK_I ) 
begin 

if( rising_edge( CLK_I ) ) then 

if( RST_I = f l' ) then 

Q <= B M OO00O0OO n ; 
elsif ( (STB_I and WE_I) = '1' ) then 

Q <= DAT_I ( 7 downto 0 ) ; 

else 

Q <= Q; 
end if; 



end if; 



end process REG; 

ACK_0 <= STB_I; 
DAT_0 <= Q; 
PRT 0 <= Q; 



end architecture WBOPRT081 ; 



Figure A-10, VHDL implementation of the 8-bit output port interface. 
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A.6.2 Simple 16-bit SLAVE Output Port With 16-bit Granularity 



Figure A-ll shows a simple 16-bit WISHBONE SLAVE output port. Table A-2 shows the 
WISHBONE DATASHEET for this design. It is identical to the 8-bit port shown earlier, except 
that the data bus is wider. Also, this port has 16-bit granularity. In the next section, it will be 
compared to a 16-bit port with 8-bit granularity. 



4-1 

U 



DAT 0(15.. 0) «4 



DAT K15..0) 




► PRT 0(15.. 0) 



Figure A-ll. Simple 16-bit WISHBONE SLAVE output port with 16-bit granularity 



Table A-2. WISHBONE DATASHEET 
for the 16-bit output port with 16-bit granularity. 


Description 


Specification 


General description: 


16-bit SLAVE output port. 


Supported cycles: 


SLAVE, READ/WRITE 
SLAVE, BLOCK READ/WRITE 
SLAVE, RMW 


Data port, size: 

Data port, granularity: 

Data port, maximum operend size: 

Data transfer ordering: 

Data transfer sequencing: 


16-bit 
16-bit 
16-bit 

Big endian and/or little endian 

Undefined i 


Supported signal list and cross reference 
to equivalent WISHBONE signals: 


Signal Name WISHBONE Equiv. 
ACK 0 ACK 0 
CLK I CLK I 
DAT 1(15.-0) DAT I() 
DAT 0(1 5. .0) DAT_0() 
RST I RSTJ 
STB I STBJ 
WE I WE I 
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A.6.3 Simple 16-bit SLAVE Output Port With 8-bit Granularity 



Figure A- 12 shows a simple 16-bit WISHBONE SLAVE output port. This port has 8-bit granu- 
larity, which means that data can be transferred 8 or 16-bits at a time. 



M-l 



DAT 0(15.. 0) <- 



DATI(15..0) — 
ACKO <h 
STB_I — 
WE_I — 
SELI(l)- — 
RST_I — 
OKI — 
SEL 1(0) — 



16 



15. .8 




CE 

RESET 
> 



7..0 



CE 



RESET 
t> 



DUAL 
8-BIT 
D-TYPE 
REGISTERS 



-> PRT 0(15.. 0) 



Figure A-12. Simple 16-bit WISHBONE SLAVE output port with 8-bit granularity. 

This circuit differs from the aforementioned 16-bit port because it has 8-bit granularity. This 
means that the 16-bit register can be accessed with either 8 or 16-bit bus cycles. This is accom- 
plished by selecting the high or low byte of data with the select lines [SEL_I(1..0)]. When 
[SELJ(0)] is asserted, the low byte is accessed. When [SEL_I(1)] is asserted, the high byte is 
accessed. When both are asserted, the entire 16-bit word is accessed. 

The circuit also demonstrates the proper use of the [STB J] and [SEL_I()] lines for slave de- 
vices. The [STB_I] signal operates much like a chip select signal, where the interface is selected 
when [STB J] is asserted. The [SEL_I()] lines are only used to determine where data is placed 
by the MASTER or SLAVE during read and write cycles. 

In general, the [SEL_I()] signals should never be used by the SLAVE to determine when the IP 
core is being accessed by a master. They should only be used to determine where data is placed 
on the data input and output buses. Stated another way, the MASTER will assert the select sig- 
nals [SEL_0()] during every bus cycle, but a particular slave is only accessed when it monitors 
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that its [STBJ] input is asserted. Stated another way, the [STBJ] signal is generated by address 
decode logic within the WISHBONE interconnect, but the [SEL_I()] signals may be broadcasted 
to all SLAVE devices. 

Table A-3 shows the WISHBONE DATASHEET for this IP core. This is very similar to the 16- 
bit data port with 16-bit granularity, except that the granularity has been changed to 8-bits. 

It should also be noted that the datasheet specifies that the circuit will work with READ/WRITE, 
BLOCK READ/WRITE and RMW cycles. This means that the circuit will operate normally 
when presented with these cycles. It is left as an exercise for the user to verify that the circuit 
will indeed work with all three of these cycles. 



Table A-3. WISHBONE DATASHEET 
for the 16-bit output port with 8-bit granularity. 


! Description 


Specification 


General description: 


16-bit SLAVE output port with 8-bit 
granularity. 


Supported cycles: 


SLAVE, READ/WRITE 
SLAVE, BLOCK READ/WRITE 
SLAVE, RMW 


Data port, size: 

Data port, granularity: 

Data port, maximum operand size: 

Data transfer ordering: 

Data transfer sequencing: 


16-bit 

8-bit 

16-bit 

Big endian and/or little endian 
Undefined 


Supported signal list and cross reference 
to equivalent WISHBONE signals: 


Sienal Name WISHBONE Eauiv. 
ACK 0 ACK_0 
CLK I CLK_I 
DAT 1(15..0) DAT_I() 
DAT O(15..0) DAT_0() 
RST I RSTJ 
STB I STB I 
WE I WE I 



Figure A- 13 shows a VHDL implementation of same circuit. The WBOPRT16 entity imple- 
ments the all of the functions shown in the schematic diagram of Figure A-12. 
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entity WB0PRT16 is 
port ( 

-- WISHBONE SLAVE interface: 



ACK_0 


out 


std_logic; 








CLK_I 


in 


std_logic; 








DAT_I 


: in 


std logic_vector ( 15 


downto 


0 


) ; 


DAT_0 


: out 


std_logic_vector ( 15 


downto 


0 


) ; 


RST_I 


: in 


std_logic; 








SEL_I 


: in 


std_logic_vector ( 1 


downto 


0 


>; 


STB_I 


: in 


std_logic; 








WE_I : 


in 


std_logic; 








-- Output port 


(non-WISHBONE signals) : 








PRT_0 


: out 


std logic_vector ( 15 


downto 


0 


) 



) ; 

end entity WB0PRT16; 

architecture WB0PRT161 of WB0PRT16 is 

signal QH: std_logic_vector ( 7 downto 0 ); 
signal QL: std_logic_vector ( 7 downto 0 ); 

begin 



REG: process ( CLK_I ) 
begin 

if ( rising_edge ( CLK_I ) ) then 
if( RST_I = '1' ) then 

QH <= B M 00000000 n ; 
elsif( (STB_I and WE_I and SEL_I(D) = '1' ) then 

QH <= DAT_I ( 15 downto 8 ) ; 

else 

QH <= QH; 
end if; 
end if; 

if ( rising_edge{ CLK_I ) ) then 
if( RST_I = '1' ) then 

QL <= B"00000000" ; 
elsif ( (STB_I and WE_I and SEL_I(0)) = ! 1' ) then 

QL <= DAT_I ( 7 downto 0 ) ; 

else 

QL <= QL; 
end if; 
end if ; 



end process REG; 



ACK 0 <= STB_I; 



DAT_ 


0( 


15 


downto 


8 


) 


<= 


QH 


DAT_ 


0( 


7 


downto 


0 


) 


<= 


QL 


prt" 


0( 


15 


downto 


8 


) 


<= 


QH 


PRT~ 


0( 


7 


downto 


0 


) 


<= 


QL 



end architecture WB0PRT161 ; 



Figure A-13. VHDL implementation of the 16-bit output port with 8-bit granularity. 
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A.7 WISHBONE Memory Interfacing 

In this section we'll examine WISHBONE memory interfacing and present some examples. The 
purpose of this section is to: 

• Introduce the FASM synchronous RAM and ROM models. 

• Show a simple example of how the WISHBONE interface operates. 

• Demonstrate how simple interfaces work in conjunction with standard logic primitives on 
FPGA and ASIC devices. This also means that very little logic (if any) is needed to im- 
plement the WISHBONE interface. 

• Present a WISHBONE DATASHEET example for a memory element. 

• Describe portability issues with regard to FPGA and ASIC memory elements. 

A.7.1 FASM Synchronous RAM and ROM Model 

The WISHBONE interface can be connected to any type of RAM or ROM memory. However, 
some types will be faster and more efficient than others. If the memory interface closely resem- 
bles the WISHBONE interface, then everything will run very fast. If the memory is significantly 
different than WISHBONE, then everything will slow down. This is such a fundamental and 
important issue that both the WISHBONE interface and its bus cycles were designed around the 
most common memory interface that could be found. 

This was very problematic in the original WISHBONE concept. That's because there are very 
few portable RAM and ROM types used across all both FPGA and ASIC devices. In fact, the 
most common memory type that could be found are what we call 'FASM', or the FPGA and 
ASIC Subset Model 7 . 

The FASM synchronous RAM model conforms to the connection and timing diagram shown in 
Figure A-14. The WISHBONE bus cycles all are designed to interface directly to this type of 
RAM. During write cycles, FASM Synchronous RAM stores input data at the indicated address 
whenever: (a) the write enable (WE) input is asserted, and (b) there is a rising clock edge. 

During read cycles, FASM Synchronous RAM works like an asynchronous ROM. Data is 
fetched from the address indicated by the ADR() inputs, and is presented at the data output 
(DOUT). The clock input is ignored. However, during write cycles, the output data is updated 
immediately during a write cycle. 

A good exercise for the user is to compare the FASM Synchronous RAM cycles to the WISH- 
BONE SINGLE READ/WRITE, BLOCK READ/WRITE and READ-MODIFY-WRITE cycles. 
You will find that these three bus cycles operate in an identical fashion to the FASM Synchro- 
nous RAM model. They are so close, in fact, that FASM RAMs can be interfaced to WISH- 
BONE with as little as one NAND gate. 



7 The original FASM model actually encompasses many type of devices, but in this tutorial we'll focus only on 
FASM synchronous RAM and ROM models. 
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While most FPGA and ASIC devices will provide RAM that follows the FASM guidelines, you 
will probably find that most devices also support other types of memories as well. For example, 
in some brands of FPGA you will find block memories that use a different interface. Some of 
these will still interface very smoothly to WISHBONE, while others will introduce a wait-state. 
In all cases that we found, all FPGAs and most ASICs did support at least one style of FASM 
memory. 





DIN DOUT 
ADR 

WE RAM 
> 


► 


► 




> 


> 





SYNCHRONOUS 
WRITE CYCLE 



CLK 



ADRO TO^ T^^ 



duo- XX3$>Cj^ 



3« 



ASYNCHRONOUS 
READ CYCLE 



WE 



DINO ffifflfflffi 

ADRO 



Figure A-14. Generic FASM synchronous RAM connection and timing diagram. 



The FASM ROM closely resembles the FASM RAM during its read cycle. FASM ROM con- 
forms to the connection and timing diagram shown in Figure A-15. 
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Figure A-15. FASM asynchronous ROM connection and ti ming diagram. 



A.7.2 Simple 16 x 8-bit SLAVE Memory Interface 

Figure A- 16 shows a simple 8-bit WISHBONE memory. The 16 x 8-bit memory is formed from 
two 16 x 4-bit FASM compatible synchronous memories. Besides the memory elements, the en- 
tire interface is implemented with a single AND gate. During write cycles, data is presented at 
the data input bus [DAT_I(7..0)], and is latched at the rising edge of [CLK_I] when [STBJ] and 
[WE I] are both asserted. During read cycles, the memory output data (DO) is made available at 
the data output port [DAT_O(7..0)]. 



WISHBONE SoC Architecture Specification, Revision B.2 



88 



3 



CO 

I— I 



DAT 0(7.. 0) < 



DAT_I(7..0) 
ADR_I(3..0) 

ACK_0 < 

CLK I 




Figure A- 16. Simple 16 x 8-bit SLAVE memory. 

The memory circuit does not have a reset input. That's because most RAM memories do not 
have a reset capability. 

The circuit also demonstrates how the WISHBONE interface requires little or no logic overhead. 
In this case, the WISHBONE interface requires a single AND gate. This is because WISH- 
BONE is designed to work in conjunction with standard, synchronous and combinatorial logic 
primitives that are available on most FPGA and ASIC devices. 

The WISHBONE specification requires that the interface be documented. This is done in the 
form of the WISHBONE DATASHEET. The standard does not specify the form of the da- 
tasheet. For example, it can be part of a comment field in a VHDL or Verilog® source file or 
part of a technical reference manual for the IP core. Table A-4 shows one form of the WISH- 
BONE DATASHEET for the 16 x 8-bit memory IP core. 

The purpose of the WISHBONE DATASHEET is to promote design reuse. It forces the origi- 
nator of the IP core to document how the interface operates. This makes it easier for another 
person to re-use the core. 
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t„ki« a. d WISHRONE DATASHEET for the 16 x 8-bit SLAVE memory. 


Description 


Specification 1 


C^,or\(*rck\ HAcr»rintinn * 


16 x 8-bit memory IP core. 


Supported cycles: 


SLAVE, READ/WRITE 
SLAVE, BLOCK READ/WRITE 
SLAVE, RMW 


Data port, size: 

Data port, granularity: 

Data port, maximum operand size: 

Data transfer ordering: 

Data transfer sequencing: 


8-bit 
8-bit 
8-bit 

Big endian and/or little endian 
Undefined 


L-lociC frequency cont>ir<Hiiti>. 


NONE (determined by memory primitive) 


Supported signal list and cross reference 
to equivalent WISHBONE signals: 


Sisnal Name WISHBONE Eauiv. 

ACK 0 ACK_0 
ADR 1(3.-0) ADRJO 
CLK I CLKJ 
DAT I(7..0) DATJO 
DAT O(7..0) DAT 0() 
STB I STBJ 
WE I WE I 


Special requirements: 


Circuit assumes the use of synchronous 
RAM primitives with asynchronous read 
capability. 



A.7.3 Memory Primitives and the [ACK_0] Signal 

Memory primitives, specific to the FPGA or ASIC target device, are usually used for the RAM 
storage elements. That's because most high-level languages (such as VHDL and Venlog®) 
don't synthesize these very efficiently. For this reason, the user should verify that the memory 
primitives are available for the target device. 

The memory circuits shown above are highly portable, but do assume that FASM compatible 
memories are available. During write cycles, most synchronous RAM primitives latch the input 
data when at the rising clock edge when the write enable input is asserted. However, during read 
cycles the RAM primitives may behave in different ways. 

There are two types of RAM primitives that are generally found on FPGA and ASIC devices: (a) 
those that synchronously present data at the output after the rising edge of the clock input and 
(b) those that asynchronously present data at the output after the address is presented to the RAM 
element. 

The circuit assumes that the RAM primitive is the FASM, asynchronous read type. That's be- 
cause during read cycles the WISHBONE interface assumes that output data is valid at the rising 
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[CLK_I] edge following the assertion of the [ACK_0] output. Since the circuit ties the [STBJ] 
signal back to the [ACK_0] signal, the asynchronous read RAM is needed on the circuit shown 
here. 

For this reason, if synchronous read type RAM primitives are used, then the circuit must be 
modified to insert a single wait-state during all read cycles. This is quite simple to do, and only 
requires an additional flip-flop and gate in the [ACK_0] circuit. 

Furthermore, it can be seen that the circuit operates faster if the asynchronous read type RAM 
primitives are used. That's because the [ACK_0] signal can be asserted immediately after the 
assertion of [STB_I]. If the synchronous read types are used, then a single-clock wait-state must 
be added. 



A.8 Point-to-point Interconnection Example 

Now that we've reviewed some of the WISHBONE basics, it's time to try them out with a sim- 
ple example. In this section we'll create a complete WISHBONE system with a point-to-point 
interconnection. The system includes a 32-bit MASTER interface to a DMA 8 unit, and a 32-bit 
SLAVE interface to a memory. In this example the DMA transfers data to and from the memory 
using block transfer cycles. 

The purpose of this system is to create a portable benchmarking device. Although the system is 
very simple, it does allow the user to determine the typical maximum speeds and minimum sizes 
on any given FPGA or ASIC target device 9 . 

Source code for this example can be found in the WISHBONE Public Domain Library for 
VHDL (under ' EXAMPLE 1' in the EXAMPLES folder). The library also has detailed docu- 
mentation for the library modules, including detailed circuit descriptions and timing diagrams for 
the INTERCON, SYSCON, DMA and memory interfaces. The reader is encouraged to review 
and experiment with all of these files. 

Figure A- 17 shows the system. The WISHBONE interconnection system (INTERCON) can be 
found in file ICNOOOla. That system connects a simple DMA MASTER (DMAOOOla) to an 8 x 
32-bit register based memory SLAVE (MEM0002a). The reset and clock signals are generated 
by system controller SYSCON (SYCOOOla). 



DMA: Direct Memory Access. 
9 Benchmarking can be a difficult thing to do. On this system the philosophy was to create a 'real-world' SoC that 
estimates 'typical maximum' speeds and 'typical minimum' size. It's akin to the 'flight envelope' of an airplane. A 
flight envelope is a graph that shows the altitude vs. the speed of the aircraft. It's one 'benchmark' for the airplane. 
While the graph may show that the airplane is capable of flying at MACH 2.3 at an altitude of 28,000 meters, it may 
never actually fly in that situation during its lifetime. The graph is simply a tool for quickly understanding the engi- 
neering limits of the design. The same is true for the WISHBONE benchmarks given in this tutorial. However, 
having said this it is important to remember that the benchmarks are real systems, and do include all of the logic and 
routing resources needed to implement the design. 
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Figure A-17. Point-to-point interconnection example. 



This system was synthesized and routed on two styles of Xilinx 10 FPGA: the Spartan 2 and the 
Virtex 2. For benchmarking purposes the memories were altered so that they used Xilinx dis- 
tributed PvAMs instead of the register RAMs in MEM0002a. A memory interface for the Xilinx 
PvAMs can be found in MEMOOOla, which is substituted for MEM0002a. 

It should be noted that the Xilinx distributed RAMs are quite efficient on the WISHBONE inter- 
face. As can be seen in the source code, only a single 'AND' gate was needed to interface the 
RAM to WISHBONE. 

The system for the Xilinx Spartan 2 was synthesized and operated on a Silicore evaluation board. 
This was a 'reality check' that verified that things actually routed and worked as expected. Some 
of the common signals were brought out to test points on the evaluation board. These were 
monitored with an HP54620a logic analyzer to verify the operation. Figure A- 18 shows an ex- 
ample trace from the logic analyzer. Address lines, data write lines and several control signals 
were viewed. That Figure shows a write cycle with eight phases followed by a read cycle with 
eight phases. The data lines always show 0x67 because that's the data transferred by the DMA 
in this example. 



10 Xilinx is a registered trademark of Xilinx, Inc. 

WISHBONE SoC Architecture Specification, Revision B.2 



92 



Sampled @ 4.0g 



0.00s 5005/ 



Sngl Pat STOP 




g .CLK; 



Tr i g= " 15 xxxx xxxx xxxx' fxx^ 0 
Pr i n t Hardc opy I /□ 

Screen Menu Menu 



Ext x 



Serv i ce 
Menu 



Figure A-l 8. Logic analyzer trace on the Spartan 2 evaluation board 



Table A-5 shows the speed of the system after synthesis and routing. The Spartan 2 bench- 
marked at about 428 Mbyte/sec, and was tested in hardware (HW TEST). The Virtex 2 part was 
synthesized and routed, but was only tested under software (SW TEST). 



Table A-5. 32-bit Point-to-] 


Doint Interconnection Benchmark Results 


MFG & Type 


Part Number 


HW 
TEST 


SW 
TEST 


Size 


Timing 
Constraint 
(MIN) 


Actual 
Speed 
(MAX) 


Data Transfer 
Rate 
(MAX) 


Xilinx 
Spartan 2 
(FPGA) 


XC2S50-5-PQ208C 






53 SLICE 


100 MHz 


107 MHz 


428 Mbyte/sec 


Xilinx 
Virtex 2 
(FPGA) 


XC2V40-4-FG256C 




V 


53 SLICE 


200 MHz 


203 MHz 


812 Mbyte/sec 



Notes: 

VHDL synthesis tool: Altium Accolade PeakFPGA 5.30a 
Router: Xilinx Alliance 3.3.06I_V2_SE2 
Hardware evaluation board: Silicore 170101-00 Rev 1.0 
Listed size was reported by the router. 

Spartan 2 test used '-5' speed grade part (slower than the faster 4 -6' part). 



11 The logic analyzer samples at 500 Mhz, so the SoC was slowed down to make the traces look better. This trace 
was taken with a SoC clock speed of 5 MHz. Slowing the clock down is also a good way to verify that the speed of 
the WISHBONE interface can be 'throttled' up and down. 
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A.9 Shared Bus Example 

Now that we've built a WISHBONE point-to-point interconnection, it's time to look at a more 
complex SoC design. In this example, we'll create a 32-bit shared bus system with four MAS- 
TERS and four SLAVEs. Furthermore, we'll re-use the same DMA, memory and SYSCON in- 
terfaces that we used in the point-to-point interconnection example above. This will demonstrate 
how WISHBONE interfaces can be adapted to many different system topologies. 

This example will require the introduction of some new concepts. As the system integrator, 
we'll need to make some decisions about how we want our system to work. After that, we'll 
need to create the various parts of the design in order to finish the job. Some of the decisions 
and tasks include: 

• Choosing between multiplexed and non-multiplexed bus topology. 

• Choosing between three-state and multiplexor based interconnection logic. 

• Creating the interconnection topology. 

• Creating an address map (using variable address decoding). 

• Creating a four level round-robin arbiter. 

• Creating and benchmarking the system. 

A.9.1 Choosing Between Multiplexed and Non-multiplexed Bus Topology 

The first step in designing a shared bus is to determine how we'll move our data around the sys- 
tem. In this section we'll explore multiplexed and non-multiplexed buses, and explore some of 
the trade-offs between them. 

The big advantage of multiplexed buses is that they reduce the number of interconnections by 
routing different types of data over the same set of signal lines. The most common form of mul- 
tiplexed bus is where address and data lines share a common set of signals. A multiplexed bus is 
shown in Figure A- 19. For example, a 32-bit address bus and 32-bit data bus can be combined to 
form a 32-bit common address/data bus. This reduces the number of routed signals from 64 to 
32. 

The major disadvantage of the multiplexed bus is that it takes twice as long to move the infor- 
mation. In this case a non-multiplexed, synchronous bus can generally move address and data 
information in as little as one clock cycle. Multiplexed address and data buses require at least 
two clock cycles to move the same information. 

Since we're creating a benchmarking system that is optimized for speed, we'll use the non- 
multiplexed scheme for this example. This will allow us to move one data operand during every 
clock cycle. 

It should be noted that multiplexed buses have long been used in the electronics industry. In 
semiconductor chips the technique is used to reduced the number of pins on a chip. In the mi- 
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crocomputer bus industry the technique is often used to reduce the number of backplane con- 
nector pins. 
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Figure A-19. Circuit and timing diagram for a multiplexed address/data bus. 



A.9.2 Choosing Between Three-State and Multiplexor Interconnection Logic 

WISHBONE interconnections can use three-state 12 or multiplexor logic to move data around a 
SoC. The choice depends on what makes sense in the application, and what's available on the 
integrated circuit. 

Three-state I/O buffers have long been used in the semiconductor and microcomputer bus indus- 
tries. These reduce the number of signal pins on an interface. In microcomputer buses with 
master-slave architectures, the master that 'owns' the bus turns its buffers 'on\ while the other 
master(s) turn their buffers 'off 13 . This prevents more than one bus master from driving any 
signal line at any given time. A similar situation also occurs at the slave end. There, a slave that 
participates in a bus cycle enables its output buffers during read cycles. 

In WISHBONE, all IP core interfaces have ' in' and 'out' signals on the address, data and other 
internal buses. This allows the interface to be adapted to point-to-point, multiplexed and three- 
state I/O interconnections. Figure A-20 shows how the 'in' and 'out' signals can be connected to 
a three-state I/O bus 14 . 



12 Three-state buffers are sometimes called Tri-State® buffers. Tri-State® is a registered trademark of National 
Semiconductor Corporation. 

13 Here, 'on' and 'off refer to the three-state and non three-state conditions, respectively. 

14 Also note that the resistor/current source shown in the figure can also be a pull-down resistor or a bus terminator, 
or eliminated altogether. 
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Figure A-20. Connection of a data bus hit to a thr ee-state interconnection. 



A simple SoC interconnection that uses three-state I/O buffers is shown in the block diagram of 
Figure A-21(a). There, the data buses on two master and two slave modules are interconnected 
with three-state logic. However, this approach has two major drawbacks. First, it is inherently 
slower than direct interconnections. That's because there are always minimum timing parame- 
ters that must be met to turn buffers on-and-off. Second, many 1C devices do not have any inter- 
nal three-state routing resources available to them, or they are very restrictive in terms of loca- 
tion or quantity of these interconnects. 

As shown in Figure A-21(b), the SoC bus can be adapted to use multiplexor logic to achieve the 
same goal. The main advantage of this approach is that it does not use the three-state routing 
resources which are not available on many FPGA and ASIC devices. 

The main disadvantage of the multiplexor logic interconnection is that it requires a larger number 
of routed interconnects and logic gates (which are not required with the three-state bus ap- 
proach). 

However, there is also a growing body of evidence that suggests that this type of interconnection 
is easier to route in both FPGA and ASIC devices. Although this is very difficult to quantify, the 
author has found that the multiplexor logic interconnection is quite easily handled by standard 
FPGA and ASIC routers. This is because: 

• Three-state interconnections force router software to organize the SoC around the fixed 
three-state bus locations. In many cases, this constraint results in poorly optimized and/or 
slow circuits. 
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o Very often, 'bit locations' within a design are grouped together. In many applications, the 
multiplexor logic interconnection is easier to handle for place & route tools. 

o Pre-defined, external I/O pin locations are easier to achieve with multiplexor logic inter- 
connections. This is especially true with FPGA devices. 



For the shared bus example we will use multiplexor logic for the interconnection. That's be- 
cause multiplexor logic interconnections are more portable than three-state logic designs. The 
shared bus design in this example is intended to be used on many brands of FPGA and ASIC de- 
vices. 
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(B) MULTIPLEXOR LOGIC INTERCONNECTION 

Figure A-21. Three-state bus interconnection vs. multiplexor logic interconnection. 
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A.9.3 Creating the Interconnection Topology 

In the previous two sections it was decided to use multiplexor interconnections with non- 
multiplexed address and data buses. In this section we'll refine those concepts into a broad in- 
terconnection topology for our system. However, we'll save the details for later. For now, we re 
just interested in looking at some of the general issues. 

In WISHBONE nomenclature, the interconnection is also called the INTERCON. The INTER- 
CON is an IP Core that connects all of the MASTER, SLAVE and SYSCON cores together. 

Figure A-22 shows the generic topology of an INTERCON that supports multiplexor intercon- 
nections with multiplexed address and data buses. By 'generic', we mean that there are N 
MASTERS and SLAVEs shown in the diagram. The actual number of MASTER and SLAVE 
interfaces can be adjusted up or down, depending upon what's needed in the system. In the 
shared bus example we'll use four MASTERS and four SLAVEs. However, for now we 11 think 
in more general terms. 

An interface called the SYSCON provides the system with a stable clock [CLK_0] and reset 
signal [RST_0]. For now, we'll assume that the clock comes from off-chip, and that the reset 
signal is synchronized from some global system reset. 

After the initial system reset, one or more MASTERS request the interconnection, which we'll 
call a 'bus' for now. MASTERS do this by asserting their [CYC_0] signal. An arbiter, winch 
we'll discuss shortly, determines which MASTER can use the bus. One clock edge after the as- 
sertion of a ICYC O] signal the arbiter grants the bus to one of the MASTERS that requested it. 
It grants the bus by asserting grant lines ONTO - GNTN and GNT(N..O). This informs both the 
INTERCON as to which MASTER can own the bus. 

Once the bus is arbitrated, the output signals from the selected MASTER are routed, via multi- 
plexors, onto the shared buses. For example, if MASTER #0 obtains the bus, then the address 
lines [ADR 0()] from MASTER #0 are routed to shared bus [ADR()]. The same thing happens 
to the data out [DAT_0()], select out [SEL_0()], write enable [WE_0] and strobe [STB_0] sig- 
nals. The shared bus output signals are routed to the inputs on the SLAVE interfaces. 

The arbiter grant lines are also used to enable the terminating signals [ACKJ], [RTY_I] and 
TERR II These are enabled at the MASTER that acquired the bus. For example, if MASTER 
#0 is "granted the bus by the arbiter, then the [ACKJ], [RTY_I] and [ERRJ] are enabled at 
MASTER #0. Other MASTERS, who may also be requesting the bus, never receive a terminat- 
ing signal, and therefore will wait until they are granted the bus. 

During this interval the common address bus [ADR()] is driven with the address lines from the 
MASTER The address lines are decoded by the address comparator, which splits the address 
space into 'N' sections. The decoded output from the comparator is used to select the slave by 
way of its strobe input [STBJ]. A SLAVE may only respond to a bus cycle when its [STBJ] is 
asserted. This is also a wonderful illustration of the partial address decoding technique used by 
WISHBONE, which we'll discuss in depth below. 
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For example, consider a system with an addressing range of sixteen bits. If the addressing range 
were evenly split between all of the SLAVEs, then each SLAVE would be allocated 16 Kbytes 
of address space. This is shown in the address map of Figure A-23. In this case, the address 
comparator would decode bits [ADR(15..14)]. In actual practice the system integrator can alter 
the address map at his or her discretion. 

Once a SLAVE is selected, it participates in the current bus cycle generated by the MASTER. In 
response to the cycle, the SLAVE must assert either its [ACK_0], [RTY_0] or [ERR_0] output. 
These signals are collected with an 'or' gate, and routed to the current MASTER. 

Also note that during read cycles, the SLAVE places data on its [DAT_OQ] bus. These are 
routed from the participating SLAVE to the current MASTER by way of a multiplexor. In this 
case, the multiplexor source is selected by some address lines which have been appropriately se- 
lected to switch the multiplexor. 

Once the MASTER owning the bus has received an asserted terminating signal, it terminates the 
bus cycle by negating its strobe output [STB_0]. If the MASTER is in the middle of a block 
transfer cycle, it will generate another phase of the block transfer. If it's performing a SINGLE 
cycle, or if its at the end of a BLOCK cycle, the MASTER terminates the cycle by negating its 
[CYCJ3] signal. This informs the MASTER that it's done with the bus, and that it can re- 
arbitrate it. 
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Figure A-22. WISHBONF. shared bus with multiplexor interconne ctions. 
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Figure A-23. Address map example. 



A.9.4 Full vs. Partial Address Decoding 

The address comparitor in our INTERCON example is a good way to explain the concept of 
WISHBONE partial address decoding. 

Many systems, including standard microcomputer buses like PCI and VMEbus, usefull address 
decoding. Under that method, each slave module decodes the full address bus. For example, if a 
32-bit address bus is used, then each slave decodes all thirty-two address bits (or at least a large 
portion of them). 

SoC buses like WISHBONE use partial address decoding on slave modules. Under this method, 
each slave decodes only the range of addresses that it uses. For example, if the slave has only 
four registers, then the WISHBONE interface uses only two address bits. This technique has the 
following advantages: 

o It facilitates high speed address decoders, 

o It uses less redundant address decoding logic (i.e. fewer gates), 

o It supports variable address sizing (between zero and 64-bits). 

o It supports the variable interconnection scheme. 

o It gives the system integrator a lot of flexibility in defining the address map. 

For example, consider the serial I/O port (IP core) with the internal register set shown in Figure 
A-24(a). If full address decoding is used, then the IP core must include an address decoder to 
select the module. In this case, the decoder requires: 32 bits - 2 bits = 30 bits. In addition, the IP 
core would also contain logic to decode the lower two bits which are used to determine which 
I/O registers are selected. 

If partial address decoding is used, then the IP core need only decode the two lower address bits 
(2 = 4). The upper thirty bits are decoded by logic outside of the IP core. In this case the de- 
coder logic is shown in Figure A-24(b). 



WISHBONE SoC Architecture Specification, Revision B.2 



101 



Standard microcomputer buses always use the full address decoding technique. That's because 
the interconnection method does not allow the creation of any new signals on the interface. 
However, in WISHBONE this limitation does not exist. WISHBONE allows the system inte- 
grator to modify the interconnection logic and signal paths. 

One advantage of the partial address decoding technique is that the size of the address decoder 
(on the IP core) is minimized. This speeds up the interface, as decoder logic can be relatively 
slow. For example, a 30-bit full address decoder often requires at least 30 XOR gates, and a 30- 
input AND gate. 

Another advantage of the partial address decoding technique is that less decoder logic is re- 
quired. In many cases, only one 'coarse' address decoder is required. If full address decoding is 
used, then each IP core must include a redundant set of address decoders. 

Another advantage of the partial address decoding technique is that it supports variable address 
sizing. For example, on WISHBONE the address path can be any size between zero and 64-bits. 
Slave modules are designed to utilize only the block of addresses that are required. In this case, 
the full address decoding technique cannot be used because the IP core designer is unaware of 
the size of the system address path. 

Another advantage of the partial address decoding technique is that it supports the variable inter- 
connection scheme. There, the type of interconnection logic is unknown to the IP core designer. 
The interconnection scheme must adapt to the types of slave IP cores that are used. 

The major disadvantage of the partial address decoding technique is that the SoC integrator must 
define part of the address decoder logic for each IP core. This increases the effort to integrate 
the IP cores into the final SoC. 
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Figure A-24. WISHBONE partial address decoding technique. 
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A.9.5 The System Arbiter 

The system arbiter determines which MASTER can use the shared bus. The WISHBONE speci- 
fication allows a variety of arbiters to be used. However, in this example a four level round- 
robin arbiter is used. 

Round-robin arbiters give equal priority to all of the MASTERS. These arbiters grant the bus on 
a rotating basis much like the four position rotary switch shown in Figure A-25. When a MAS- 
TER relinquishes the bus (by negating its [CYC_0] signal), the switch is turned to the next posi- 
tion, and the bus is granted to the MASTER on that level. If a request is not pending on a certain 
level, the arbiter skips over that level and continues onto the next one. In this way all of the 
MASTERS are granted the bus on an equal basis. 



MASTER #0 
Q 



MASTER #3 O 



O MASTER #1 



O 

MASTER #2 

Figure A-25. Round-robin arbiters grant the bus on a rotating 
basis much like a rotary switch. 



Round-robin arbiters are popular in data acquisition systems where data is collected and placed 
into shared memory. Often these peripherals must off-load data to memory on an equal basis. 
Priority arbiters (where each MASTER is assigned a higher or lower level of priority) do not 
work well in these applications because some peripherals would receive more bus bandwidth 
than others, thereby causing data 'gridlock'. 

The arbiter used in this example can be found in the WISHBONE Public Doma in Library for 
VHDL . ARBOOOla is used for the example. 



A.9.6 Creating and Benchmarking the System 

The final task in our shared bus system example is to create and benchmark the entire system. 
The INTERCON in our example system is based on the generic shared bus topology that was 
described above. However, that system is fine tuned to give the exact features that we will need. 

The final system supports four DMAOOOla MASTERS, four MEM0002a memories (SLAVEs), a 
32-bit data bus, a five bit address bus, a single SYCOOOla system controller and a ARBOOOla 



WISHBONE SoC Architecture Specification, Revision B.2 



104 



four level round-robin arbiter. The resulting VHDL file can be found under ICN0002a in the 
WISHBONE Public Domain Library for VHDL . 

In this application, the round-arbiter was chosen because all of the MASTERS are DMA con- 
trollers. That means that all four MASTERS continuously vie for the bus. If a priority arbiter 
were used, then only the one or two highest priority MASTERS would ever get the bus. 

As we'll see shortly, the error and retry signals [ERR_I] and [RTYI] are not supported on the 
MASTER and SLAVE interfaces on our example system. That's perfectly okay because these 
signals are optional in the WISHBONE specification. We could have added these signals in 
there, but they would have been removed by synthesis and router logic minimization tools. 

Since all of the MASTERS and SLAVEs on this system have identical port sizes and granulari- 
ties, the select [SEL] interconnection has been omitted. This could have been added, but it 
wasn't needed. 

The INTERCON system includes a partial address decoder for the SLAVEs. This decoder cre- 
ates the system address space shown in Figure A-26. The final address map is shown in Table 
A-6. 
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Figure A-26. Address map used by the INTERCON example. 



Table A-6. Address spaces used by INTERCON. [ 
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Source code for this example can be found in the WISHBONE Public Domain Library for 
VHDL (in the EXAMPLES folder). The library also has detailed documentation for the library 
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modules, including detailed circuit descriptions and timing diagrams. The reader is encouraged 
to review and experiment with all of these files. 

This system was synthesized and routed on two styles of Xilinx 15 FPGA: the Spartan 2 and the 
Virtex 2. For benchmarking purposes the memories were altered so that they used Xilinx dis- 
tributed RAMs instead of the register RAMs in MEM0002a. A memory interface for the Xilinx 
RAMs can be found in MEMOOOla, which is substituted for MEM0002a. 

It should be noted that the Xilinx distributed RAMs are quite efficient on the WISHBONE inter- 
face. As can be seen in the source code, only a single 'AND' gate was needed to interface the 
RAM to WISHBONE. 

The system for the Xilinx Spartan 2 was synthesized and operated on a Silicore evaluation board. 
In order to verify that the system actually does run correctly, an HP54620a logic analyzer was 
connected to test pins on the board, and some of the signals were viewed. Figure A-27 shows the 
trace. Address lines, data write lines and several control signals are shown. 
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Figure A-27. Logic analyzer trace on the Spartan 2 evaluation board 



16 



Table A-7 shows the speed of the system after synthesis and routing. The Spartan 2 bench- 
marked at about 220 Mbyte/sec, and was tested in hardware (HW TEST). The Virtex 2 part was 
only synthesized and routed, and showed a maximum speed of about 404 Mbyte/sec (SW TEST). 



15 Xilinx is a registered trademark of Xilinx, Inc. 

16 The logic analyzer samples at 500 Mhz, so the SoC was slowed down to make the traces look better. This trace 
was taken with a SoC clock speed of 5 MHz. 
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Table A-7. 32-bit Shared Bus Interconnection Benchmark Results 
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Type 


Part Number 


HW 
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sw 
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Size 


Timing 
Constraint 
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Actual 
Speed 
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Data Transfer 
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Xilinx 
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(FPGA) 


XC2S50-5-PQ208C 


V 




356 SLICE 


55 MHz 


55MHz 


220 Mbyte/sec 


Xilinx 
Virtex 2 
(FPGA) 


XC2V250-5-FG256C 




V 


355 SLICE 


99 MHz 


101 MHz 


404 Mbyte/sec 



Notes: 

VHDL synthesis tool: Altium Accolade PeakFPGA 5.30a 
Router: Xilinx Alliance 3.3.06I_V2_SE2 
Hardware evaluation board: Silicore 170101-00 Rev 1.0 
Listed size was reported by the router. 

Spartan 2 test used '-5' speed grade part (slower than the faster '-6' part). 
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Altium unveils new "Board-on-Chip" technology 

Altium to preview industry-first technology at Programmable World that allows engineers to use 
board-level methodologies to design and implement embedded systems on FPGAs 

SYDNEY, Australia - April 28, 2003 - Altium Limited (ASX: ALU), a leading developer of Windows-based 
electronic design and development software, announced today that it will preview a new design 
technology that brings together the worlds of software and hardware design in a single, integrated design 
environment. This new technology enables engineers to use board-level design methodologies to both 
design and implement an entire digital system - including embedded microprocessor cores and the 
software that runs on them - onto an FPGA. Altium claims that this new technology gives engineers the 
power to literally start putting their board designs straight into FPGAs, a process that Altium calls 
"Board-on-Chip" (BoC) design. 

Despite the emergence of high capacity, low cost FPGAs that offer the potential to be used as a platform 
for digital system design, there are currently significant cost and technology barriers facing engineers who 
wish to exploit this potential. Current tools are primarily HDL based and not well integrated with 
embedded software tools. Also, HDL-based IP cores are expensive and have complex licensing 
schemes. To date, these barriers have hindered the penetration of FPGAs as a platform solution for 
mainstream engineers. 

Altium says that its new BoC technology will enable engineers to overcome these barriers and use 
familiar design methodologies to harness the power of FPGAs as a design platform. "We recognize the 
potential of FPGAs as a design platform but currently tools to allow engineers to fully exploit this potential 
are just not there/' says Nick Martin, Joint CEO and founder, Altium. "However, the technology that we 
are developing at Altium will allow us to deliver a system that enables rapid implementation and testing of 
complete digital systems on FPGAs - without the need for HDL source files or extensive synthesis and 
verification." 
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Board-on-Chip technology 



In order to facilitate BoC design, Altium has brought together technology from the full range of its EDA 
and embedded products. Altium's nVisage multi-dimensional design capture and TASKING embedded 
software technologies (including Viper multi-core compiler and debugger technologies) have been 
combined on the Design Explorer (DXP) platform to provide a single, integrated hardware design and 
software development environment. 

The BoC system that Altium will preview at Programmable World will include: 

• Mixed schematic/VHDL design capture 

• Integrated software development 

• Processor core packs that combine pre-synthesised processor cores with matching compiler, 
simulator and debugger 

• Schematic component libraries containing a range of pre-synthesised components including 
common peripheral devices, 74xxx series logic and a range of communication and interface 
components 

• Primitives and macro libraries for all Xilinx and Altera device families 

• 'Virtual' instruments such as logic analyzers and frequency counters that can be built into the 
design for test purposes 

• A BoC development board loaded with an FPGA that functions as a breadboard and allows 
implementation of the design directly from the engineer's PC onto the FPGA. 

"What makes this technology a real break-through is that it will provide a fully-integrated and self- 
contained system for FPGA-based design that will be accessible to the vast majority of engineers" says 
Martin. "This BoC technology fepresents a new EDA paradigm and has the potential to significantly lower 
development costs and provide huge benefits to engineers in terms of time to market and design 
flexibility" 

True parallel development of hardware and software 

One of the most significant benefits of Altium's BoC technology is that it allows a more flexible approach 
to partitioning the design between hardware and software - engineers retain the ability to choose a 
hardware or software solution to any particular problem throughout the design process. Also, the inclusion 
of a targeted development board as part of the BoC system allows hardware dependencies in the 
software to be addressed before the target hardware is finalized. 

In general software tends to remain fluid throughout the development process because it is easy to 
update and change. Hardware, on the other hand, tends to be locked down early in the process because 
of time and cost involved in each hardware iteration and the fact that the board needs to be available in 
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order to finalize software development. However, using the BoC system with its reconfigurable FPGA- 
based hardware platform, hardware updates can be made as easily as software updates. This makes it 
possible to continue to explore different hardware options throughout the design process and allows final 
hardware/software partitioning decisions to be delayed until late in the development process. 

Reducing development costs and time to market 

The more flexible and parallel design approach enabled by the BoC technology will dramatically reduce 
time to market compared to traditional linear process flows. Within the BoC system, a new prototype is 
effectively generated each time the design is downloaded to the development board. This means that 
systems can be more fully developed and tested before fabricating the first PCB prototype. Also, because 
more of the design is implemented inside the FPGA, including the processor, the final PCB design and 
fabrication will be simpler, leading to further cost and time savings. 

Further information 

Altium will be demonstrating its BoC system at the Programmable World forum which begins on May 6. 
For details go to www.xilinx.com/events/pw2003/index.htm . 

About Altium Limited 

Altium Limited (ASX: ALU), trading as Protel International Limited (ASX: PRI) prior to August 6, 2001, is a 
leading global developer and supplier of desktop Electronic Design Automation (EDA) and embedded 
software design tools for the Microsoft Windows environment. 

Since the company's foundation in 1985 and its release of the world's first Microsoft Windows-based 
EDA tool in 1991, Altium has continued to apply the most advanced software design methods to provide 
powerful, easy-to-use and affordable design software to engineers and electronics designers worldwide. 

Altium's current product brands include Protel, nVisage, P-CAD, TASKING, Accolade, CircuitMaker and 
CAMtastic. These products offer tailored solutions covering a range of hardware and software design 
processes. 

Altium is headquartered in Sydney, Australia, and operates a number of sales and support offices in 
Australia, the United States, Japan and Europe, as well as maintaining a large reseller network in all other 
major markets. More information about the company and its products and services may be obtained from 
our website at www.altium.com . 

Altium, Protel, DXP, Design Explorer, nVisage, P-CAD, TASKING, Accolade, CircuitMaker, CAMtastic, Situs and Topological 
Autorouting and their respective logos are trademarks or registered trademarks of Altium Limited or its subsidiaries. All other 
registered or unregistered trademarks referenced herein are the property of their respective owners, and no trademark rights to the 
same are claimed. 
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Altium introduces systems focus to FPGA design with Nexar 

Altium releases industry's first out -of -the -box design environment for putting entire 

embedded systems on FPGAs 

SYDNEY, Australia - November 17, 2003 - Altium Limited (ASX: ALU), a leading developer of Windows- 
based electronics design software, today announced Nexar, the industry's first product to provide a 
comprehensive, vendor-independent solution for system-level design on an FPGA platform. Derived from 
Altium's demonstrated Board-on-Chip technology, Nexar integrates hardware design tools, embedded 
software development tools, IP-based components, virtual instrumentation and a reconfigurable 
development board to allow mainstream engineers, even those without HDL experience, to interactively 
design and implement a complete embedded system inside an FPGA. 

The benefits Nexar brings to engineers include parallel design of hardware and software, greater flexibility 
in hardware/software design partitioning, an integrated, FPGA vendor-independent solution for putting 
entire embedded systems into FPGAs, and a "live", interactive design environment for system-on-FPGA 
development and debug. 

"Nexar is the first complete system-on-FPGA design environment built upon live' real-time, hands-on 
engineering that happens inside the physical hardware design space using an approach that maps 
directly to an engineer's existing knowledge of system-level design," says Nick Martin, Altium's founder 
and Joint CEO. "The Nexar environment will be delivered complete and ready to use, with design 
hardware, design software and IP all in one box. Nexar will unlock the potential of current and next- 
generation high-capacity, low-cost FPGAs for every engineer." 

While many engineers look to FPGA technology to provide higher levels of on-chip integration and a 
lower risk alternative to the cost and lead time of conventional ASICs, system-level design on an FPGA 
platform is a difficult exercise, particularly when it comes to bringing the processor into the FPGA. Nexar 
changes this by taking proven board-level system design methodologies and retargeting them for FPGA 
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architectures. Nexar also integrates hardware design and software development within a single 
environment to provide a total solution to systems design. 

The result is a revolutionary system-on-FPGA product that will allow risk-free chip-level systems 
integration, practical hardware/software co-design, a complete systems-level development environment 
for FPGA-based embedded design and introduces an interactive design methodology called LiveDesign 
that is accessible to mainstream engineers. 

Introducing a new IP delivery model 

At the system level, Nexar provides a schematic-based design methodology to define system 
connectivity. The reason being that g-aphical schematic-based capture is more efficient for connecting 
functional blocks than HDLs and allows complex systems to be created quickly at the component level. 

Schematic design is facilitated in Nexar by the inclusion of extensive libraries of royalty-free, pre- 
synthesized, pre-verified IP components, including a range of processor cores, that can be simply 
dropped onto the schematic and connected together to form the system hardware. This is analogous to 
the way designers currently work at the board-level with physical, 'ofMhe-shelf components. 

Nexar components are processed for a variety of target FPGA architectures from multiple vendors. This 
allows design portability between FPGA device families and ensures a flexible vendor-independent 
approach to FPGA design. Nexar automatically and transparently selects the correct component model 
for the target architecture during system synthesis. As a result, designs can be synthesised very quickly 
because the component IP does not need to be reprocessed during synthesis. 

Nexar's component system provides a novel and secure framework for FPGA IP delivery that avoids the 
security problems associated with supplying IP as HDL source code. 

Seeing inside the FPGA with virtual instrumentation 

Along with IP components, Nexar includes a library of IP-based virtual instruments such as logic 
analysers, frequency counters/generators and 10 monitors that can be incorporated into the design at the 
schematic level to facilitate system debugging. 

Like the IP components, the virtual instruments are supplied as pre-synthesized models that allow them 
to be used across FPGA target architectures. These instruments have on-screen front panels analogous 
to their physical counterparts to provide an intuitive way for engineers to examine the working of their 
circuit, and to 'see' inside the FPGA during the design process. 
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A reconfigurable hardware platform 



Integral to Nexar is a versatile FPGA-based development board called a NanoBoard that provides a 
reconfigurable platform for implementing and debugging the design. The NanoBoard is connected to the 
engineer's PC and uses JTAG-based communications to both download the design to the on-board 
FPGA, and to interact with processor cores and instruments in the design once it has been downloaded. 

Target FPGAs are housed on plug-in daughter boards to allow easy retargeting of designs. Multiple 
NanoBoards can be chained together to facilitate the design of complex multi-FPGA systems, and can 
accommodate the inclusion of end-user boards into the system for final production PCB testing and 
debugging. 

Integrated software development 

To make it easy to develop system software, Nexar includes a complete set of software development 
tools for all supplied processor cores. Using Altium's Viper reconfigurable compiler technology, Nexar 
provides high-quality code development and debugging that is fully integrated with the hardware 
development environment. 

Once the target design has been downloaded to the NanoBoard, all processors in the design can be 
controlled and debugged from within the Nexar system. This enables software development to take place 
directly on the target hardware from early in the design cycle, supporting parallel development of 
hardware and software. Hardware designers can download their designs to the NanoBoard for interactive 
debugging during development, and software designers can develop their program code directly on the 
'real' hardware from early in the design cycle. 

Because hardware can be updated with the same ease as the software, Nexar allows more flexible 
design partitioning. Implementing the design on the NanoBoard means a physical prototype isn't required 
to support completion of the debug process, delaying the need to finalize hardware until later in the 
development cycle. 

LiveDesign environment 

Nexar's interactive system design environment and ability to directly connect designers and developers 
with their designs allows engineers to adopt a very 'hands on' approach to the development process. 
Altium calls this design methodology LiveDesign. 

Unlike conventional electronics design flows, LiveDesign-enabled tools represent a new methodology for 
hardware/software system design flows. The LiveDesign electronics development platform supports real- 
time on-the-fly design and debug of a physical circuit and has the potential to significantly effect the way 
electronic systems and products are designed with benefits for design speed, flow, quality and cost. 
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LiveDesign also allows system implementation for debug purposes, minimizing the reliance on time- 
consuming system-level simulation required by other design flows. There is little time or cost penalty 
involved in multiple design iterations, leaving engineers free to try different design solutions without the 
need to physically manufacture prototype boards. 

Nexar benefits 

Nexar has wide applicability across the electronics industry, and will be particularly beneficial in industries 
such as automotive, industrial control, telecoms and datacoms infrastructure, and non-electronics 
consumer products such as whitegoods, where product value is relatively high, but market size doesn't 
allow conventional ASIC development. 

Nexar will provide immediate benefits to: 

o Engineers who routinely develop digital systems using off-the-shelf silicon devices - Nexar 
provides an intuitive method for migrating board-level systems into device-level designs; 

o FPGA designers looking for an easier way to embed soft processor cores into their designs - 
Nexar provides a reconfigurable development environment, the tools and IP to support complete 
systems implementation; 

© Hardware & software engineers designing low- to medium-complexity embedded microcontroller 
applications - Nexar integrates hardware and software development flows from the first line of 
code through board-level implementation. 

Nexar represents a new way of approaching digital design. In particular Nexar will provide: 

o Shorter embedded systems development cycles by enabling parallel design of hardware and 
software; 

o Greater flexibility in hardware software design partitioning; 

o An integrated solution for putting entire embedded systems into FPGAs; 

o The benefits of chip-level integration to all designers, regardless of their HDL knowledge and 
expertise; 

o A personal ASIC alternative to high-cost factory ASIC implementation; 

© An FPGA vendor-independent solution to systems design; 

© A "live", interactive environment for system-on-FPGA development and debug. 

Pricing and availability 

Nexar 2004 will be available for shipment during the first quarter of the 2)04 calendar year at an 
anticipated list price of US$7,995. For more product information, visit www.altium.com/nexar/ or contact 
your local Altium sales and support center. 
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About Altium Limited 



Altium Limited (ASX: ALU) is a global developer and supplier of electronics design software for the 
Microsoft Windows environment. Founded in 1985, Altium released the world's first Microsoft Windows- 
based printed circuit board design tool in 1991 and continues to provide advanced, easy-to-use and 
affordable software design tools to electronics engineers, designers, and developers worldwide. Altium's 
products offer tailored solutions covering a range of hardware and software design processes including 
the well-known Protel, P-CAD and TASKING brands. Altium is headquartered in Sydney, Australia and 
has sales and support offices in Australia, the United States, Japan and Europe. More information is 
available at www.altium.com . 

Altium, Nexar, LiveDesign, NanoBoard, Protel, DXP, Design Explorer, nVisage, PCAD, TASKING, Accolade, CircuitMaker, 
CAMtastic, Situs and Topological Autorouting and their respective logos are trademarks or registered trademarks of Altium Limited 
or its subsidiaries. All other registered or unregistered trademarks referenced herein are the property of their respective owners, and 
no trademark rights to the same are claimed. 
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